V per 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




(51) International Patent Classification 5 

C12N 15/57,9/54, O07K3/08 
//(C12N9/54, C12R 1:07) 



Al 



(11) International Publication Number: WO 92/21760 

(43) International Publication Date: 10 December 1992 (10.12.92) 



(21) International Application Number: PCT/US92/04306 

(22) Internationa] Filing Date: 21 May 1992 (21.05.92) 



(30) Priority data: 

706,691 



29 May 1991 (29.05.91) 



US 



(71) Applicant: COGNIS, INC. [US/US]; 2330 Orcadian Way 

Santa Rosa, CA 95407 (US). 

(72) Inventors: CHRISTIANSON, Teresa ; 208 Eucalyptus 

^ n r\? 0t ^\ CA n 94931 (US >' GODDETTE, Dean ; 
806 Carina Circle, Rohnert Park, CA 94928 (US). LAD- 

CA ^^TC^T I. 4 , 83 ? FCrng,en DriVe > Sailta R °* a > 

^ 9 M2» LAU ' Maria ' R * ; 3177 Serra Court 
Fairfield, CA 94533 (US). PAECH, Christian ; 2803 Au^ 
dubon Court, Santa Rosa, CA 95403 (US). REYNOLDS 

^41? S£ rIa . no Santa Rosa > CA 95404 

(US). WILSON, Charles, R. ; 2323 Pacheco Place, Santa 
Rosa CA 95401 (US). YANG, Shiow-Shong ; 1108 Na 
varro Street, Santa Rosa, CA 95401 (US). 



(74) Agent: DRACH, John, E.; Henkel Corporation 140 Ger- 
rnantown Pike, Suite 150, Plymouth Meeting, PA 19002 
(US). 

(81) Designated State: AT (European patent), BE (European 
patent), CH (European patent), DE (European patent), 
DK (European patent), ES (European patent), FR (Eu- 
ropean patent), GB (European patent), GR European 

f ??i'J J Emope * n paten, >> JP > LU (European pa- 
tent), MC (European patent), NL (European patent), SE 
(European patent). 

Published 

With international search report. 
Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: MUTANT PROTEOLYTIC ENZYMES FROM BACILLUS 

(57) Abstract 

ic mutagenesis. The amino add shes seleS for rS^^n Pleases are expressed by genes which are mutated by site-specif, 
pares £ three dimensional 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify State* party to the PCT on the fiont pages of pamphlets publishing international 
application* unriei the PCT. 



AT 


Austria 


Kl 


1* inland 


Ml 


Mult 


All 


Australia 


FR 


France 


MN 


Mongolia 


BB 


Barbados 


CA 


(.jahun 


MR 


Mauritania 


BE 


Belgium 


OB 


U Illicit klllgltulll 


MW 


Malawi 


BF 


Burkina hasu 


CN 


Guinea 


Nl. 


Netherlands 


BU 


Bulgaria 


CR 


Greece 


NO 


Norway 


BJ 


Benin 


HU 


Hungary 


PL 


Poland 


BR 


Bra/il 


IE 


Ireland 


RO 


Rumania 


CA 


C anada 


IT 


Italy 


RU 


Russian Federation 


CF 


Central Afrh.au Republic 


JP 


Sa pan 


SD 


Sudan 


CG 


C ongo 


KF 


Democratic People's Republic 


SE 


Sweden 


CM 


Svw iuci Ltiid 




of kurua 


SN 


Senegal 


*:i 


Cote tl'lvniic 


KR 


Republic dl Korea 


SU 


Soviet Union 


CM 


t 'aineruun 


LI 


I ieclilcnslein 


TD 


Chad 


CS 


CVcchusluvaLia 


IJi 


Sri l*iiiLa 


TO 


Togo 


UE 


(iernuuty 


LU 


Luxembourg 


US 


United Stale* of America 


OK 


Denmark 


MC 


Monaco 






ES 


Spain 


MC 


Madagascar 







9221760A1 I > 



WO^!/21760 



PCIYUS92/04306 



MUTANT PROTEOLYTIC ENZYMES FROM BACILLUS 

pACKGRQUND OF THE INVENTION 

1. Field of the Invention 

This invention relates to mutant proteolytic enzymes 
having improved properties relative to the wild-type 
enzyme, to genetic constructs which code for the mutant 
5 proteolytic enzymes, to methods of predicting mutations 

which enhance the stability of the enzyme, and to methods 
of producing the mutant proteolytic enzymes. 

2. Description of th e Related Art 

Subtilisins are a family of extracellular proteins 
10 having molecular weights in the range of 25,000-3 5,000 

daltons and are produced by various Bacillus species. These 
proteins function as peptide hydrolases in that they 
catalyze the hydrolysis of peptide linkages in protein 
substrates at neutral and alkaline pH values. Subtilisins 
15 are termed serine proteases because they contain a specific 

serine residue which participates in the catalytic 
hydrolysis of peptide substrates, A subtilisin enzyme 
isolated from soil samples and produced by Bacillus lentus 
for use in detergent formulations having increased protease 
20 and oxidative stability over commercially available enzymes 

under conditions of pH 7 to 10 and at temperature of 10 to 
60*C in aqueous solutions has been disclosed in copending 
patent application serial number 07/398,854, filed on 
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mutagenesis, en the other hand, requires a rational 
approach for the introduction of amino acid changes. In 
this approach one or more amino acids may be replaced by 
other residues by altering the DNA sequence which encodes 
the protein. This can be accomplished using 

oligonucleotide directed in vitro mutagenesis. The 
following references teach site-directed mutagenes is 
procedures used to generate specific amino acid 
Lbstitution(s): Hines, J.C.. and Ray, D.S. (1980) Gene 
11:207-216; Zoller, M.J. , and Smith, M. (1982) Nucleic 
Acids Res. 10:6487-6500; Norrander, J., et al. (1983, Gene 
26:101-106; Morinaga, Y., et al. (1984) Bio/Technology 
2:636-639; Kramer, W. , et al. (1984) Nucleic Acids Res. 
12:9441-9456; Carter, P., et al. (1985) Nucleic Acids Res. 
15 13:4431-4443; Kunkel, T.A. (1985) Proc. Natl. Acad. Sci. 

USA 82:488-492; Bryan, P., et al. (1986) Proc. Natl. Acad. 
Sci. USA 83:3743-3745. 

A rational approach may or may not require knowledge of 
a protein's structure. For example, patent application WO 
89/06279 describes the comparison of the primary amino acxd 
sequence of different subtilisins while contrasting 
differences in physical and chemical properties. The 
primary amino acid sequences of the different subtilisins 
are aligned for the greatest homology, while taking into 
account amino acid insertions, deletions, and total number 

of amino acids. 

Currently, the amino acid sequences of at least 10 
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subtilisin proteases have been published. Eight of these 
subtilisins were isolated from species of Bacilli, and 
include subtilisin 168 (Stahl, M.L. , and Ferrari, E. (1964) 
j Bacteriol. 158:411-418), subtilisin BPN • (Vasantha, N. , 
et al., (1984) J. Bacteriol. 159:811-819), subtilisin 
Carlsberg (Jacobs, M. , et al. (1985) Nucleic Acids ^ Res . 
13:8913-8926), subtilisin DY (Nedkov, P., et al. (1985) 
Biol. Chem. Hoppe-Seyler 366:421-430), subtilisin 
amylosacchariticus (Kurihara, H. , et al. (1972) J. -Biol. 
Chem. 247:5619-5631), subtilisin mesenticopeptidase 
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(Svendsen. I., et al. (»..) W ^*' 

subtilisin 147 and subtilisin 309 (Ha.trup ^et .1. 19.9 WO 
89/06279), subtilisin PB92 (Van EeK.len at •»•<»•»» » 

03 8229). and subtilisin BLAP (Ladin, B .t .1. Jl 90) 
society for Industrial Microbiology Annual Meeting. 
Strict P60, . The remaining two subtilisin sequence, are 
th „rmitase from the fungus TAermoactinomyc* vulgaris 
thermitase fro 183 :195-200) , and 

(Meloun . B et a! . <1 ) Tritl „ cMuD aifcuB Jxmber 
!o. K a^aye, B. Biol. chem. Hoppe-Seyler 

"^etlodV'for obtaining optimum alignment of homologous 
proteins are described in Atlas of Protein Sequence end 
structure. Vol. 5, Supplement 2 ,1976, (Dayhof f M O. , ed 
Natl Biomed. P-es. Found., Silver Spring., *>> • This 
prison is then used to identify specific 
alterations which night produce desirable improvement in 
tne target enzyme. Wells, J.A.. «t al. (1987) Proc. Natl. 
" ' sci . usa 84:1219-1223, used primary seouence 
augment to predict site directed mutations 
the substrate specificity of a subtilisin. »»» 
alignment approach WO 89/06279 teaches the construction of 
lllZ subtmsins having improved properties 
increased resistance to oxidation, increased proteolytic 
ac«v"y! and improved washing performance for laundry 
activity, J patent applications WO 89/09819, 

detergent applications. patent «pp ....hi ntv 

and WO 89/09830 teach improvement in the thermal stability 
of subtilisin BPK. by the introduction of one or -or. amino 
acid change, based on the alignment of the primary ammo 
acid seguence. of subtilisin BPK- with the more therm, 
stable subtilisin Carlsberg. From hereon, amino acids will 
L referred to by the one or three letter code as defined 
in Table 1. 
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TABLB 1 

On* and Thrae Latter Cod* for Amino Acids 

- Alanine 

- cysteine 

> Aspartic acid or aspartate 

■ Glutamic acid or glutamate 

- Phenylalanine 
= Glycine 
= Histidine 
= isoleucine 
b Lysine 
b Leucine 

- Methionine 

■ Asparagine 
15 P - Pro - Proline 

Glutamine 
Arginine 
Serine 
Threonine 

20 V * Val - Valine 

Tryptophan 
Tyrosine 

Rational mutational approaches may also predict 
mutations which improve an enzyme property based upon the 

25 three dimensional structure of an enzyme, in addition to 

the alignment of primary amino acid sequences described 
above. One method for determining the three dimensional 
structure of a protein involves the growing of crystals of 
the protein, followed by X-ray crystallographic analysis. 

30 This technique has been successfully used to determine 

several high resolution subtilisin structures such as 
thermitase (Teplyakov, A.V. , et al. (1990) 214:261-279), 
subtilisin BPN' (Bott, R. , et al. (1988) J. Biol. Cham. 
263:7895-7906) and subtilisin Carlsberg (Bode, W. , et al. 

35 (1986) EMBO J. 5:813-818), for example. 

EP 0251446 teaches the construction of mutant carbonyl 
hydrolases (proteases) which have at least one property 
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different from the parental carbonyl hydrolase, 
describes stations which effect (either improve or 
decrease) oxidative stability, substrate specificity, 
catalytic activity, thermal stability, alkaline stability, 
P H activity profile, and resistance to autoproteolysis. 
These mutations were selected for introduction into 
Bacillus amyloliquefaciens subtilisin BPN' after alignment 
of the primary sequences of BPN' and proteases from B. 
subtilis, B. lichsniformis. and thermitase. Such alignment 
can then be used to select amino acids in these 
other proteases which differ, as substitutes for the 
equivalent amino acid in the B . amyloliquefaciens carbonyl 
hydrolase. This application also describes alignment on 
the basis of a 1.8 A X-ray crystal structure of the B. 
amyloliquefaciens protease. Amino acids in the carbonyl 
hydrolase of B. amyloliquefaciens which when altered can 
affect stability, substrate specificity, or catalytic 
efficiency include: MetSO, Metl24, and Met222 for 
oxidative stability; Tyrl04, Alal52, GlulSf, Glyl66, 
Glyl69, Phel89, and Tyr2l7 for substrate specificity; N155 
alterations were found to decrease turnover, and lower Km; 
Asp36, Ilel07, Lysl70, Aspl97, Ser204, Lys213, and Met222 
for alkaline stability; and Metl99, and Tyr21 for thermal 
stability. Alteration of other amino acids was found to 
affect multiple properties of the protease. Included in 
this category are Ser24, MetSO, Aspl56, Glyl66, Glyl69, and 
Tyr217. substitution at residues Ser24, MetSO, Ilel07, 
G1U156, Glyi66, Glyl69, Ser204, Lys213, Gly215, and Tyr217 
was predicted to increase thermal and alkaline stability. 
An important point about this patent application is that 
with the exception of those mutations effecting substrate 
specificity, no rational mutational approach for improving 
the alkaline or temperature stability of a protease based 
upon computer simulations of an X-ray crystal structure is 

35 described. 

WO 88/08028 teaches a method for redesigning proteins 
to increase stability by altering amino acid residues that 
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are in close proximity to the protein's metal ion binding 
site. This application describes the alteration of a 
calcium ion binding site present within subtilisin BPN # 
through the substitution, insertion, or deletion of amino 
5 acid residue (s) in close proximity to that site so that the 

electrostatic attraction between the amino acids and the 
calcium ion is increased. The characterization of the 
calcium ion binding site is accomplished through the 
analysis of a 1.3 A three dimensional structure of 

10 subtilisin BPN* using a high resolution computer graphics 

system. This approach allows the selection of amino acids 
acceptable for replacing the native amino acids in the 
protease by first simulating the change using the computer 
model. This allows for the identification of any problems 

15 including steric hindrance prior to the actual construction 

and testing of the mutant proteases. 

US patents 4908773 and 4 853871 teach a computer based 
method for evaluating the three dimensional structure of a 
protein to select amino acid residues where the 

20 introduction of a novel disulfide bond will potentially 

stabilize the protein. Potentially acceptable amino acid 
residues can then be ranked, and replaced using computer 
simulation, prior to the actual construction of the mutant 
protein using site directed mutagenesis protocols. 

25 Several patent applications combine published data on 

biochemical stability with computer analysis of three 
dimensional protease structures in order to predict 
mutations which stabilize the enzyme. US 4 , 914,031 and WO 
88/08033 and WO 87/04461 teach a method for improving the 

30 pH and thermal stability of subtilisin aprA by replacing 
asparagine residues present in asparagine/glycine pairs. 
Asparagine/glycine pairs in proteins have been shown to 
undergo cyclization to form cyclic imide 
anhydroaspartylglycine (Bornstein, P., and Balian, G. 

35 (1977) Methods Enzymol. 47:132-145). This cyclic imide is 

susceptible to base hydrolyzed cleavage leading to 
inactivation of the enzyme. Computer analysis of the three 
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dimensional structure of the eprA protease also predicted 
that formation of the cyclic i»ide could lead to protease 
inactivation resulting fro. a shift of the side chain of 
the active site serine. The decision to replace the 
5 asparagine residue and not the glycine residue was based 
upon alignment of the aprA sequence with other subtilisin- 
like enzymes, cucumisin and proteinase K. 

Sensitivity to oxidation is an important deficiency of 
serine proteases used in detergent applications (Stauffer, 
10 C.E., and Etson, D. (1969) J. Biol. Chem. 244:5333-5338). 

EP 0130756, EP 0247647, and US 4,760,025 teach a saturation 
mutation method where one or multiple mutations are 
introduced into the subtilisin BPN • at amino acid residues 
Asp32, Asnl55, Tyrl04, Het222, Glyl66, His64 , Ser221, 
Glyl69, G1U156, Ser33, Phel89, Tyr217, and/or Alal52. 
Using this approach mutant proteases exhibiting improved 
oxidative stability, altered substrate specificity, and/or 
altered P H activity profiles are obtained. A method is 
taught in which improved oxidative stability is achieved by 
substitution of methionine, cysteine, tryptophan, and 
lysine residues. These publications also teach that 
mutations within the active site region of the protease are 
also most likely to influence activity. Random or selected 
mutations can be introduced into a target gene using the 
25 experimental approach but neither EP 0130756, EP 0247647, 
nor US 4,760,025 teach a method for predicting amino acid 
alterations which will improve the thermal or surfactant 
stability of the protease. 

WO 8705050 teaches a random mutagenesis approach for 
30 construction of subtilisin mutants exhibiting enhanced 
thermal stability. One or more random mutations are 
introduced into single stranded target DNA using the 
chemical mutagens sodium bisulfite, nitrous acid, and 
formic acid. Subsequently, the mutated DNA is transformed 
35 into a Bacillus host and at least 50,000 colonies are 

screened by a filter assay to identify proteases with 
improved properties. Site directed mutagenesis can then be 
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used to introduce all possible mutations into a sits 
identified through the random mutagenesis screen. No 
method for pre selection of amino acids to be altered is 
taught. 

EP 0328229 teaches the isolation and characterization 
of PB92 subtilisin mutants with improved properties for 
laundry detergent applications based upon wash test 
results. It teaches that biochemical properties are not 
reliable parameters for predicting enzyme performance in 
the wash. Methods for selection of mutations involve the 
substitution of amino acids by other amino acids in the 
same category (polar, nonpolar, aromatic, charged, 
aliphatic, and neutral), the substitution of polar amino 
acids asparagine and glutamine by charged amino acids, and 
increasing the anionic character of the protease at sites 
not involved with the active site. No method for 
identifying which specific amino acids should be altered is 
taught, and no rational mutational approach is taught which 
is based on alignment of X-ray structures of homologous 
20 proteases with different properties. 

EP 0260105 teaches the construction of subtilisin BPN' 
mutants with altered transesterif ication rate/hydrolysis 
rate ratios and nucleophile specificities by changing 
specific amino acid residues within 15 A of the catalytic 
triad. Russell, A.J., and Fersht, A.R. (1987) Nature 
328:496-500, and Russell, A.J., et al. (1987) J. Mol. Biol. 
193:803-813, teach the isolation of a subtilisin BPN' 
mutant (D099S) that had a change in the surface charge 14- 
15 A from the active site. This substitution causes an 
effect on the pH dependence of the subtilisin's catalytic 
reaction. 

There are a number of different strategies for 
increasing protein stability. Many of these methods suggest 
types of substitutions to improve the stability of a 
protein but do not teach a method for identifying amino 
acid residues within a protein which should be substituted. 
From entropic arguments, many types of substitutions have 



25 



30 



35 



SUBSTITUTE SHEET 



WO 92/21760 



PCT/US92/04306 



10 



been suggested such as Gly to Ala and any amino acid to Pro 
(Matthews, B.V., et al. (1987) Proc. Natl. Acad. Sci. 
84:6663-6667). Likewise, while it is clear that increasing 
the apolar size of an amino acid in the core will add to 
5 stability, adverse packing effects may more than compensate 
for the hydrophobic effect, resulting in a decrease in 
protein stability (Sandberg,W.S. , and Terwilliger, T.C. 
(1989) Science 245:54-57) . Henendez-Arias, L. , and Argos, 
P. (1990) J. Mol. Biol. 206:397-406, performed a 

10 statistical evaluation of amino acid substitutions of 
thermophilic and mesophilic molecules and proposed that 
decreased flexibility and increased hydrophobicity in the 
a-helical regions contributes most towards increasing 
protein stability. From their data, they formulated a set 

15 of empirical rules to improve stability. 

increasing the hydrophobicity of certain side chains 
has long been suggested as a means to improve protein 
stability. The hydrophobic exclusion of nonpolar amino 
acids is the largest force driving protein folding. This 

20 has been studied by examining the partitioning of amino 
acids or amino acid analogs from water to a hydrophobic 
medium. While the numbers vary depending on the work, 
these studies generally agree that burying a hydrophobic 
side chain increases protein stability. For example, 

25 Kellis, J.T., Jr., et al. (1988) Nature 333:784-786, 
estimated that the removal of a methyl group destabilizes 
the enzyme by 1.1 kcal/mole assuming no other structural 
perturbaticns occur. Conversely, this predicts that the 
addition of a methylene group should add 1.1 kcal/mol if no 

30 unfavorable contacts occur. Similarly, Sandberg, W.S., and 
Terwilliger, T.C. (1989) Science 245:54-57, showed that the 
effect of removing or adding methylene groups is the sum of 
the hydrophobic effect and structural distortions. Simply 
adding buried hydrophobic groups may not increase protein 

35 stability because the total effect of adding or deleting a 

methyl group on the local packing structure must be 
considered. As the protein interior has a para-crystalline 



- 9221760A1 - L> ™ "-rrr 



WO 92/21760 



PCT/US92/04306 



11 

structure (Chothie, C. (1975) Nature 254:304-308), small 
distortions in the remainder of the structure resulting 
from the addition methyl group may exact a high cost and 
reduce rather than increase stability. 
5 Along the same lines, the core of X repressor has been 

shown to be amazingly tolerant to apolar amino acid 
substitutions in a functional assay (Bowie, J.U., et al. 

(1990) Science 247:1306-1310). It is not clear that this 
is true for larger proteins. The constraints on the 

10 hydrophobic core of a small protein may be less stringent 
than a larger protein simply due to the volume of the core 
relative to the number of amino acids which need to pack 
into the region. As the volume of the hydrophobic core 
increases, the number of amino acids which must pack 

15 together correctly increases, requiring more specific 

nonlocal interactions. 

It has been recognized that increasing the interior 
hydrophobicity of a protein as a means of increasing the 
stability is hampered by the difficulty of determining 

20 which positions in the protein will lead to stabilization 

when substituted (Sandberg, W.S., and Terwilliger, T.C. 

(1991) Trends Biotechnol. 9:59-63). The methods discussed 
above provide a means of determining what substitutions to 
make to improve stability but do not identify which sites 

25 in the protein are most important. The present invention 

provides a method of determining which positions in the 
protein will lead to stabilization when substituted. 

SUMMARY OP THE INVENTION 
Other than in the operating examples, or where 
30 otherwise indicated, all numbers expressing quantities of 
ingredients or reaction conditions used herein are to be 
understood as modified in all instances by the term 
"about". 

The native or wild* type protease from which the mutant 
35 proteases according to the invention are derived is a B. 

lent us alkaline protease (BLAP) obtained from fi« lentus 
DSM 5483 having 269 amino acid residues, a molecular mass 
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of 26,823 daltons, and a calculated isoelectric point of 
9.7 based on standard pK values. The BLAP gene is obtained 
by isolating the chromosomal DMA from the B. lentus strain 
DSM 5483, constructing DNA probes having homology to 
5 putative DNA sequences encoding regions of the B . lentus 

protease, preparing genomic libraries from the isolated 
chromosomal DNA, and screening the libraries for the gene 
of interest by hybridization to the probes. 

Mutant B. lentus DSM 5483 proteases have been made 

10 which are derived by the replacement of at least one amino 

acid residue of the mature form of the B. lentus DSM 5483 
alkaline protease. The sites for replacement are selected 
from the group consisting of Ser3, Val4, Ser36, Asn42, 
Ala47, Thr56, Thr69, Glu87, Ala96, AlalOl, Ilel02, Serl04, 

15 Asnll4, HisllS, Alal20, Serl30, Serl39, Thrl41, Serl42, 

Serl57, Alal88, Vall93, Vall99, Gly205, Ala224, Lys229, 
Ser236, Asn237, Asn242, His24 3, Asn255, Thr268. The 
replacement amino acid residues are listed in Table 2. The 
numbering of the mutant proteases is based on the 

20 B. lentus DSM 5483 wild-type protease as given in the SEQ 

ID NO: 52. 

Genes which express the mutant B. lentus DSM 5483 
proteases according to the invention are made by altering 
one or more codons of the wild-type B. lentus DSM 5483 

25 alkaline protease gene which encode for a protease derived 

by accomplishing at least one of the amino acid 
substitutions listed in Table 2. 

The protease sites listed in Table 2 are sites 
predicted to affect thermal and surfactant stability 

30 relative to the wild-type protease. These sites are 

identified by means of a computer based method which 
compares the three dimensional structure of the wild-type 
protease (henceforth, the target protein) and a homologous 
protease (henceforth, the reference protein) . The three 

35 dimensional coordinates of the wild-type protease are 

probed with an uncharged probe molecule to produce a probe- 
accessible surface which has an external surface the 
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interior of which contains one or more probe-accessible 
internal cavities. The amino acids of the reference protein 
having side chains lying outside the solvent-accessible 
surface or inside the internal cavities of the target 
5 protein are identified by aligning the three dimensional 
coordinates of the target protein and the reference 
protein. 

Proteins having greater thermal and surfactant 
stability are produced by replacing the amino acid in the 
10 target protein if the amino acid in the target protein can 
be changed without creating unacceptable steric effects. 
The amino acid in the target protein is altered by site 
directed mutagenesis of the gene which expresses the target 
protein. 

15 Genetic constructs are made which contain in the 

direction of transcription a promoter, ribosomal binding 
site, initiation codon and the major portion of the pre 
region of the Bacillus licheniformis ATCC 53926 alkaline 
protease gene operably linked to a portion of the pre 

20 region and all of the pro and mature regions of the 

Bacillus lentus DSM 5483 alkaline protease gene followed by 
a 164 bp DNA fragment containing the transcription 
terminator from the ATCC 53926 alkaline protease gene. The 
Bacillus lentus DSM 5483 alkaline protease gene is altered 

25 to produce a mutant gene which encodes for a protease 
derived by accomplishing at least one of the amino acid 
substitutions listed in Table 2. Mutant protease is made by 
fermenting a Bacillus strain transformed with a genetic 
construct containing a mutated Bacillus lentus DSM 5483 

30 alkaline protease gene. 

BRIEF DE8CRIPTIOH OF THE FIGURES 

Figure 1 shows the atomic coordinates for Bacillus 
lentus alkaline protease (BLAP) to 1.4 A resolution. 

Figure 2 shows the restriction map for plasmid pCB13C 
35 which contains a hybrid gene fusion between the Bacillus 

licheniformis ATCC 53926 protease gene and the Bacillus 
lentus DSM 5483 BLAP gene. The promoter, ribosomal binding 
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site and presence (P-53926, fro. ATCC 53926 were fused 
to the pro- and mature sequence of the BLAP gene. The 
p ^ . nf XTCC 53926 (T-53926) was 

transcription terminator of ATCC i 

appended to the BLAP coding region. 

Figure 3 shows the restriction »ap for plasmid P Mcl3C 
which is derived from pMac5-8 and contains the BIAP gene 
and carries an amber mutation in the Ap* gene which renders 

it inactive. 

BBBCSIPTIOM OT THB FRErelUtBD EMBODIMENTS 
- n one aspect of the invention relates to mutant 

proteolytic ensymes which have superior thermal stability 
Ld surfactant stability relative to the wild-type protease 
„ determined by laboratory tests. The mutant ■«*••"> 
according to the invention are those derived by the 
replacement of at l«st on. amino add residue of the 
mature Bacillus lentus DSM 5483 alXaline protease wherein 
said one amino acid residue which is selected from the 
group consisting of Ser3, van. S.r36, Asn42, Ala47 ThrW. 
Thr«9, Glu87, A1.96, AlalOl, IU102. Serl04, Asnll4. 
Hi.118. A1.120, Serl30. Serl39, «rl4l. Serl42, Serl57 
Mal88, V.11S3. V.1199, ^05, A1.224. Ly.229, S.r236 
Asn237. Asn242, Hi.243 , Asn255. Thr288 is replaced with the 
emino acid residue, listed in Table 2. Table 2 shows the 
identity and position of the wild-type amino acid and the 
amino acid residue (s) which replace . it in the mutant 
protein. Fcr example, the first entry in Table 2 show. 
Ser3. a .erine residue at position 3 which can be replaced 
by threonine (abbreviated as T using the one letter code 
for amino acids) or any small amino acid. A small amino 
acid is defined as glycine, alanine, valine, serine, 
threonine or cysteine. A small hydrophobic amino acid is 
defined as glycine, alanine, threonine, valine or 
isoieucine. A charged amine acid is defined as lysine, 
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arginine, histidine, glutamate or aspartate. The 
abbreviation a. a. stands for "amino acid" residue. 

TABLE 2 

Residue Replacement Amino Acid 

5 Ser3 T or any small, hydrophobic a. a. 

Val4 I« S or any small a. a. 

Ser36 A, T or any small a. a. 

Ser42 F, A, T, V, I, Y 

Ala47 W or any small a. a. except A 

10 Thr56 V, S or any small, hydrophobic a. a. 

Thr69 R, A or any charged a. a. 

Glu87 R, M or any charged a. a. 

Ala96 I, N, S or any small, hydrophobic a. a. 

AlalOl T, S or any small, hydrophobic a. a. 

15 Ilel02 W or any small a. a. except P 

Serl04 T or any small, hydrophobic a. a. 

Asnll4 S, Q or any small, hydrophobic a. a. 

Hisll8 F or any a. a. except P and W 

Alal20 V or any small, hydrophobic a. a. 

20 Serl30 A, T or any small, hydrophobic a. a. 

Serl39 A, T, Y or any a. a. except P and W 

Thrl41 W or any a. a. except P 

Serl42 A, T or any small, hydrophobic a. a. 

Serl57 T or any small, hydrophobic a .a. 

25 Alal88 P or any small, hydrophobic a. a. 

Vall93 M or any small, hydrophobic a.a. 

Vall99 I or any small, hydrophobic a.a. 

Gly205 V or any small, hydrophobic a.a. 

Ala224 V or any small, hydrophobic a.a. 

30 Lys229 W or any a.a. except P 

Ser236 A, T or any small, hydrophobic a.a. 

Asn237 A, N, Q, M or any small, hydrophobic a.a. 

Asn242 A, N, Q, M or any small, hydrophobic a.a. 

His243 A, N, Q, M or any small, hydrophobic a.a. 

35 Asn255 P or any small, hydrophobic a.a. 

Thr268 V or any small, hydrophobic a.a. 
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The amino acid sequences of the preferred proteolytic 
enzymes are given in SEQ ID N0:1 to SEQ ID NO: 51. The 
preferred mutated B. lentus DSM 5483 proteases which ar« 
encoded for by genes according to the invention as 
disclosed above are given in SEQ ID NO: 53 to 105. These 
proteases are produced by bacterial strains which have been 
transformed with plasmids containing a native or hybrid 
gene, mutated at one or more nucleotide base pairs by known 
mutagenesis methods. These mutant genes encode for 
proteases in which selected amino acid residues have been 
substituted for by other amino acids. 

The mutant proteases according to the invention are 

listed in Table 3. 

Table 3 

Mutation T cmrw,r.t u rt stability SpS Stability 

50"C, 60»C, pH 10. S, pH 8.6, 

pH 11.0 pH 10.0 50*C 50«C 
t»j (min) t«j (Bin) t>j (min) tJj(min) 



S3T, V4I, A188P, V193K, V199I 


120 


67 


3.2 


12 


S3T t A188P, V193H, V199I 


95 


60 


3.75 


16.5 


V4I, A188F, V193K, V199I 


72 


39 


1.75 


3.75 


S139Y, A188P, V193K, V199I 


69 


■ 33 


1.4 


4.6 


S130T, S139Y, A188F, V193M, V199I 


64 


22 


2 


6.3 


A188P, V193M, V199X 


55 


23.5 


3.0 


12.5 


S3T, A188P, V193M 


54 


21 


1.5 


3.4 




52 


17.5 


1.2 


0.95 


S157T 




7.25 


A188P, V193M 


50 


27 


2.5 


A188P 


48 


19 


1.4 


2.8 




3.7 


S3T, V4I, A188P, V193M 


43 


21 


1.4 


V193M 


42 


16.6 


1.2 


3.0 




1.8 




42 


8 


1.0 


S104T 








T69V 


41 


12.3 


0.8 


1.8 


V4I, A188P r V193M 


40 


19 


1.25 


2.7 


A224V 


39 


15 


0.9 


1.1 




2.0 


V199I 


38.5 


11.6 


1.0 


V4I 
53T 


32.5 


10 


0.75 


1.0 


32 


6.6 


1.2 


2.8 


S139Y 


26 


8.8 


1.0 


2.0 




1.9 


N242A 


26 


7.4 


0.9 


1.0 


2.0 


S236T 


25.5 


8.4 




1.8 


S36A 


23.8 


8.6 


0.9 
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TABLE 4 (cont. ) 



Mutation Temperature Stability SOS Stability 

50*C, «0»C, pH 10.5, pH 8.6, 

pH 11.0 pH 10.0 50»C 50*C 
t% (aln) t»j (Bin) t>> (rain) th (Bin) 





H243A 


23 


5.9 


0.8 


1.7 




A101T 


23 


4.7 


0.5 


2.75 




S236A 


23 


5.1 


0.8 


1.3 




E87R 


22.5 


9.0 


0.4 


1.2 


5 


N114S 


22 


7.9 


1.1 


1.3 




A47W 


21 


7.2 


0.9 


1.05 






20.5 


8.4 


0.9 


1.4 




fpci;V 


20 


8.5 


0.8 


0.7 




A1«VV 


20 


11.8 


0.65 


1.9 


X w 




20 


6.6 


1.1 


2.8 




e \ 30X 

0 UUA 


20 


8.8 


0.4 


1.0 




MJUi 


20 


7.2 


0.4 


1.1 






19 


12 


1.0 


1.4 




S104T, S139Y, A224V 


16 


9.5 


1.0 


1.8 


15 


S139A 


16.5 


7.8 


0.5 


0.8 




S142T 


17.5 


11.5 


0.9 


1.7 




S139T 


16.5 


4.3 


0.5 


0.8 




I102W 


16.5 


7.2 


0.7 


1.6 




A96N 


16 


6 


0.9 


0.95 


20 


N42P 


16 


5.9 


1.0 


1.4 




SX42A 


16 


9 


1.0 


1.7 




Hiier 


15.8 


5.1 


1.0 


1.3 




N237A 


15 


7.8 


0.67 


1.3 




N255P 


15.0 


5.3 


1.2 


1.25 


25 


T141W, N237A 


14 


5.4 


0.33 


1.1 




T268V 


14 


3.8 


0.75 


1.1 




K229W 


13.4 


4.6 


1.0 


1.4 




T141W 


12 


6.5 


0.6 


1.4 




wildtype 


12.0 


3.0 


0.8 


1.6 



Any of the proteases listed in Table 3 will exhibit greater 
stability in some manner than the wild-type protease BIAP. 
The entries under the "Mutation" heading of Table 3 shows 
the identity of the wild-type amino acid (using the one 
35 letter code), its position, and the amino acid which 
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replaces it in the mutant protease. For example, S3T 
signifies that the serine at position 3 of the mature 
protease is replaced with a threonine. Some of the 
preferred mutant proteases are single replacements at 
5 specific locations such as a protease wherein valine at 
position 4 is replaced by isoleucine to specific 
combinations of replacements such as a protease wherein 
threonine at position 141 is replaced by tryptophan and 
asparagine at position 237 is replaced by alanine. The 

10 latter protease containing two replacements is one of only 

a number of possibilities. 

The preferred mutant proteases according to the 
invention are identified as: (S3T, V4I, A188P, V193M, 
V199I); E87R; (S3T, A188P, V193M, V199I) ; N114S; (V4I, 

15 A188P, V193M, V199I); A47W; (S139Y, A188P, V193M, V199I) ; 

A120S; (S130T, S139Y, A188P, V193M, V199I) ; T56V; 
A120V; (A188P, V193M, V199I) ; G205V; (S3T, A188P, V193M) ; 
S130A; S130T; S157T; A96I; (S104T, S139Y, A224V) ; S139A; 
S142T; S139T; I102W; V193M; A96N; N42F; S142A; H118F; 

20 N237A; N255P; (T141W, N237A) ; T268V; K229W; T141W; (A188P, 

V193M) ; V4I; S3T; S139Y; N242A; S236T; S36A; H243A; A101T; 
S236A; A188P; <S3T, V4I, A188P, V193M) ; V193M; S104T; T69V; 
(V4I, A188P, V193M); A224V; V199I. The system used to 
designate the above preferred proteases first lists the 

25 amino acid residue in the mature form of the B. lentus DSM 
5483 alkaline protease at the numbered position followed by 
the replacement amino acid residue using the one letter 
codes for amino acids. For example, V193M is a protease in 
which valine has been replaced by methionine at position 

30 193 of the mature B . lentus DSM 5483 alkaline protease. A 

mutant protease identified by more than one such 
designation is a mutant protease which contains all of the 
indicated substitutions. For example, (A188P, V193M) is a 
protease in which valine has been replaced by methionine at 

35 position 193 of the mature B . lentus DSM 5483 protease and 

alanine at position 188 has been replaced by proline. 
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Mutant forms of the B. lentus DSM 5483 alkaline 
protease are prepared by site-specific mutagenesis of DNA 
encoding the mature form of either wild-type BLAP, or a 
mutant BLAP. The DNA fragment encoding the mature form of 
5 wild type BLAP was prepared using plasmid pCB13C. Plasaid 
pCB13C contains a hybrid fusion between the 
B . llchenilormis ATCC 53926 protease gene and the B. lentus 
DSM 5483 BLAP gene, shown in Figure 2. Specifically, this 
hybrid fusion contains DNA encoding the promoter, ribosomal 

10 binding site, and 21 residues of the pre sequence from the 

ATCC 53926 protease gene fused to a DNA sequence encoding 
the last five residues of the BIAP pre sequence and all of 
the pro and mature residues of BLAP. This fusion is 
referred to as the Cial fusion because this restriction 

15 site is located at the juncture between the ATCC 53926 and 

DSM 5483 DNA 1 s . A new Clal restriction site had to be 
introduced into the ATCC 53926 alkaline protease gene near 
to the junction of the pre and pro sequences. The Clal 
site was introduced into the ATCC 53926 alkaline protease 

2 0 gene by using a polymerase chain reaction (PCR) to amplify 

a DNA fragment containing sequence information from the N- 
terminal part of the ATCC 53926 alkaline protease gene. 
The amplified fragment included the ATCC 53926 alkaline 
protease promoter, ribosomal binding site, initiation 
25 codon, and most of the pre sequence* This 292 bp DNA 

fragment was flanked by Aval and Clal restriction sites at 
its 5 1 and 3 1 ends, respectively. The BIAP gene already 
contained a naturally occurring Clal site at the 
corresponding position. Analysis of the DNA sequence 

3 0 across the fusion of the ATCC 53926 and BIAP genes 

confirmed the expected DNA and amino acid sequences. 

Before any mutagenesis can be carried out, the gene is 
subcloned into the mutagenesis vector pMa5-8. This is 
accomplished by synthesizing a DNA fragment containing the 
35 Clal fusion gene and the ATCC 53 926 transcription 

terminator as a Sail cassette using the PCR. The PCR was 
carried out using conditions as described by the 
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manufacturer (PerKin Elmer Cetus, NorwalK, CT.). In the 
PCR two synthetic oligonucleotides bearing Sail Bite, are 
used as primers and Escherichia coli vector pCBHC DNA as 
a template. After cutting the PCR product with Sail, this 

5 fragment is cloned into the mutagenic plasmid pMc5-8 which 

has previously been cut with Sail and dephosphorylated with 
bacterial alkaline phosphatase. Plasmids pMc5-8, and P Ma5- 
8 described below were obtained from H.-J. Fritz and are 
described by Stanssens, P., et al. (1989) Nucleic Acids 

10 Res. 17:4441-4454. Sail sites are chosen to allow the PCR 

fragment to be cloned into P Mc5-8 in both orientations. The 
ligation mix is transformed into E. colx WK6. 
Chloramphenicol resistant (Cm*) transf ormants are screened 
for the presence of an insert and a correct plasmid 

15 construct P Mcl3C is identified as shown in Figure 3. Once 

the gene is cloned into the pMc vector and desirable sites 
for mutation are identified, the mutation(s) is introduced 
using synthetic DNA oligonucleotides according to a 
modification of a published protocol (Stanssens, P., et al. 

20 (1989) NUCleic Acids Res. 17:4441-4454). The 

oligonucleotide containing the mutation (s) to be introduced 
is annealed to a gapped duplex (gd) structure which carries 
the BLAP gene on a segment of single stranded (ss) DNA. 
The gapped duplex can be formed by annealing linear ss DNA 

25 from PMC13C with denatured and restricted pMa5-8 DNA. 

Plasmid pMa5-8 contains an active ampicillin resistance 
gene but has an inactivating point mutation in the 
chloramphenicol resistance gene, whereas plasmid P Mcl3C 
contains, in addition to an intact BLAP gene, an active 

30 chloramphenicol resistance gene, but has an inactivating 

point mutation in the ampicillin resistance gene. The 
annealed product is the gd DNA which is a double stranded 
heteroduplex with a ss DNA gap spanning the entire cloned 
BLAP gene. The mutant oligonucleotide is able to anneal to 

35 homologous ss BLAP DNA within the gap and the remaining gap 
is filled in by DNA polymerase I (Klenow fragment) and 
ligated using T4 DNA ligase, purchased from New England 
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Biolabs Inc., Beverly, Ma. The mutagenic efficiency of 
such a system can be improved by the use of Exonuclease III 
(Exo III) purchased from New England Biolabs Inc., Beverly, 
MA. Exo III is an exodeoxyribonuclease that digests 
5 double stranded DKA from the 3 1 end. As a free 3 1 end is 
required, closed circular ss DNA or ds DNA is unaffected by 
this enzyme. A subsequent treatment of the product of the 
fill-in reaction with Exo III removes any species with only 
partially filled gaps. This significantly improves the 

10 mutagenic efficiency and is the preferred mutagenesis 
method. The product of the fill-in reaction is then 
transformed into a repair deficient E* coli strain such as 
WK6mutS and ampicillin resistant transf ormants (Ap R ) are 
selected • Replication of the transformed heteroduplex 

15 phasmid results in two different progenies. One progeny 

contains the wild type BLAP gene and the intact 
chloramphenicol resistance gene, but an inactive ampicillin 
resistance gene* The other progeny contains a BLAP gene 
carrying the mutation of interest and is resistant to 

20 ampicillin but not to chloramphenicol. 

Selection of Ap R , Cm s mutant transf ormants with 
ampicillin is not sufficient to stop some background growth 
of the Ap s , Cm R progeny carrying the wild type BLAP gene. 
Therefore, it is necessary to perform a second 

25 transformation into E. coli using plasmid DNA prepared from 

the Ap R transformers of the WK6mutS strain. This second 
transformation uses a low plasmid concentration with a 
large number of recipient cells of a suppressor deficient 
strain of E. coli each as WK6. This approach decreases the 

30 likelihood of a recipient cell receiving plasmid DNA from 

both progeny. Ap B transf ormants are selected and plasmid 
DNA from several transf ormants is isolated and screened for 
the presence of the mutation. The pMa mutant derivative of 
the first mutagenesis round can be used for a second round 

35 of mutagenesis by preparing ss DNA of that species and 

annealing it to Xbal/HindUI restricted and denatured DIJA 
of pMc5-8. Plasmid pMc5-8 is identical to pMa5-8 except 
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that It contains an active chloramphenicol resistance gene 
and an inactive ampicillin resistance gene. The general 
procedure is the sane as that described above. 

The mutant BLAP proteases can be produced by 
5 transferring the mutant BLAP genes from their particular I. 
coli pMal3C derivative vector into a plasmid vector which 
can replicate in Bacillus. To accomplish this, the mutant 
BLAP genes are separated from their pMal3C plasmids by 
digestion with the restriction endonucleases Aval and Sstl, 

10 followed by ligation to the larger Aval/SstI fragment from 

either plasmid pH70 or pC51. These Aval/SstI fragments 
from pH70 and pC51 include the DNA sequences necessary for 
replication in Bacillus and encode either Jcanamycin 
resistance (Km R ) or tetracycline resistance (Tc"), 

15 respectively. Plasmid pH70 is constructed by cloning the 
ATCC 53926 alkaline protease gene carried on a BcoRI/BamHI 
DNA fragment into the Km R plasmid pDBHO between the BcoRI 
and BamHI sites. Plasmid pC51 is constructed by cloning 
the ATCC 53926 protease gene carried on a ZcoRI-BamHI 

20 fragment into the Tc R plasmid pBC16 between the BcoKL and 

BamHI sites. The larger Aval -Sstl fragment from either 
pH70 or pC51 used for cloning the mutant BLAP genes is 
first purified from other DNA fragments by high pressure 
liquid chromatography (HPLC) on a Gen-Pak FAX column 

25 (Waters, Milford, MA). The column is 4.6 mm by 100 mm in 

size and contains a polymer-based high performance anion- 
exchange resin. Conditions for elution of the DNA are a 
flow rate of 0.75 ml/min with a gradient of Buffer A (25 
mH tris(hydroxymethyl)aminomethane (Tris) pH 8 . 0 containing 

3 0 1 mM disodium ethylenediamine tetraacetic acid (EDTA) ) and 

Buffer B (25 mM Tris pH 8.0, 1 mM EDTA, 1 M NaCl) starting 
at 50% each and reaching a final concentration of 30% 
Buffer A and 70% Buffer B. 

After ligation the mutant BLAP plasmids are 

35 transformed into B. subtilis DB104. The genes encoding 
the major alkaline and neutral proteases present in this 
strain have been inactivated (Kavamura, P., and Doi, R.A. 
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(1984) J. Bacteriol. 160:442-444). Cell! Of B. BUbtiliB 
DB104 transformed by these plasmids grow on a nutrient-skim 
■ilk agar in the presence of either kanamycin or 
tetracycline. Trans formants of DB104 that manufacture 
5 mutant protease are identified by the formation of clear 
zones of hydrolysis in the skim milk, confirmation that the 
protease-producing transf ormants carry a plasaid-borne BIAP 
gene with the desired mutation (s) is accomplished by 
purifying plasmid DNA from a culture of each transf ormant. 
10 The plasmid DNA is purified away from cell protein and 
chromosomal DNA by SDS-salt precipitation followed by 
chromatography over a Qiagen ion-exchange column (Qiagen 
Corporation, Studio City, CA) . Aval-SstI digested plasmid 
DNAs from different transf ormants are compared with 
15 Aval /Ss t 1 -digested derivatives of plasmid pH70 or pC51 

known to carry an intact BLAP gene. Restriction digests of 
these plasmids are compared by agarose gel electrophoresis 
to identify plasmids that have the proper-sized Aval/SstI 
DNA fragments. Selected plasmid DNAs are then sequenced 
20 across the region of the expected BIAP nutation (s) to 

confirm that the desired mutation (s) are present. One or 
more clones of each BIAP mutation are stored frozen in 15% 
glycerol at -70"C and also cultivated in shake flasks 
(Example 4, Production of Proteases) to produce mutant 
25 protease for characterization. 

Another aspect of the invention provides a computer 
based method for identifying the sites which affect the 
storage, thermal, SDS and pH stability of a protein. This 
method is based on the hypothesis that protein stability 
30 may be enhanced by decreasing tne volume of internal 
cavities and improving surface packing of amino acid side 
chains. The interior of a protein contains many apolar 
amino acids which are tightly packed into a nearly 
crystalline state. One way in which these interior amino 
35 acids affect protein stability is through packing effects. 

These include van der Waal interactions, distortion of the 
remainder of the protein and electrostatic effects. 
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target protein coordinates with an uncharged probe molecule 
having a radius of about 0.9 to about 2.0 A. It is 
important that no water molecules be included in the 
protein structure during this analysis. The second step of 
this method is the identification of the amino acids which 
form the boundaries of the internal cavities. These amino 
acids comprise a set of positions which, if mutated,, may 
increase the stability of the protein. An increase in 
stability can be achieved by amino acid substitutions which 
decrease the volume of the internal cavities. 

The molecular modeling program QUANTA (trademark of 
Polygen Corporation, 200 Fifth Ave., Waltham, MA 02254) was 
used to calculate probe-accessible surfaces as well as 
perform the alignment of the three dimensional coordinates 
of the proteins. These functions can be carried out 
equally well by other molecular modeling programs which are 
also commercially available. The following is a list of 
commercially available programs which can also be used to 
calculate probe-accessible surfaces: Insight or Insightll 
20 (trademark of Biosym Technologies, Inc., 10065 Barnes 

canyon Road - Suite A, San Diego, CA 92121), BIOGRAF 
(trademark of Biodesign, Inc., 199 S. Los Robles Ave., 
#270, Pasadena, CA 91101) or Sybyl (trademark of Tripos 
Associates, 1699 S. Hanley Road, St. Louis, MO 63144) 

The probe-accessible surface referred to in step 1 of 
the first method can be generated in several ways 
(Richards, F.M. (1977) Annu. Rev. Biophys. Bioeng. 6:151- 
176): A spherical probe of radius R (0.9 to 2.0 A) is 
allowed to roll on the outside of a molecule while 
maintaining contact with the van der Waal surface. The 
surface defined by, the center of the probe is defined as 
the probe-accessible surface. Alternatively, a similar 
surface can be generated by increasing the van der Waal 
radii of all the atoms in a protein by the radius of the 
probe. Overlapping surfaces are eliminated and the 
remaining surface represents the probe-accessible surface. 
In the preferred embodiment, a three-dimensional box df 
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dimension. 50x50x50 A with a 1 A grid size in all three 
dimensions (x, y, and z) ia centered on the center of mass 
of the target protein coordinate! . Most preferrably, the 
dimensions of the probe map are adjusted such that all of 
5 the protein atoms fall within the probe map's bounds. The 

grid size of 1 A provides a sufficiently high resolution to 
clearly define the probe-accessible surface although 
another grid size could be used, ranging from 0.5 to 3.0 A. 
An uncharged probe molecule is positioned at each grid 
10 point and the energy of interaction between the probe and 
the target protein atoms is determined. The energy of 
nonbonded interaction (B^) contains only the van der Waal 
component such that 

XQOATXCm (1) 
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30 



where r is the nonbonded distance, is the dispersion 

well depth and o Li is the Lennard- Jones diameter. The 
result is a map consisting of a box with energy values at 
each grid point. This map can be contoured at a particular 
energy value to generate surfaces which correspond to the 
20 solvent accessible surface and internal cavities (Goodford, 

P.j. (1985) J. Med. Chem. 28? 49-857). The value at which 
to contour the maps can var; depending on the particular 
radius used and the parameters used to define the probe 
molecule and the particular method used to generate the 
25 probe. The preferred embodiment -s to used a probe radius 

of 0.9 A and contour the surface at 10 )ccal/mol. 

The external surface of the probe-accessible surface 
is also known as the solvent-accessible surface. Probe- 
accessible surfaces inside of the solvent accessible 
surface are defined as internal cavities and represent 
cavities large enough to accommodate a molecule with a 
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radius equal to the probe radius. The presence of such e 
cavity on the inside of a protein does not imply that the 
cavity will in fact be filled by one or more solvent 
molecules. 

5 The second step of the method for identifying sites 

which affect the stability of a protein is the 
identification of the amino acids which form the internal 
cavities. The internal cavities are defined by the amino 
acids which make up its boundaries. These amino acids 

10 comprise a set of positions which, if mutated, may increase 
the stability of the protein. 

In a second method for identifying sites which affect 
the stability of a protein , the first step comprises 
generating a probe-accessible surface by analyzing the 

15 target protein coordinates with an uncharged probe molecule 
having a radius of about 0.9 to about 2.0 A. It is 
important that no water molecules be included in the 
protein structure during this analysis. This step is the 
same as the first step of the method set forth above. 

20 The second step involves aligning the three 

dimensional structure of the target protein and a reference 
protein by moving the three dimensional coordinates of the 
reference protein into the coordinate frame of the target 
protein. The reference protein is usually chosen so that a 

25 high degree of similarity exists between it and the target 
protein so that packing differences between the target and 
reference protein which potentially affect the stability of 
the target protein can be identified. The reference protein 
can be any protein for which a three dimensional structure 

30 is available which is homologous to the target protein. 

Examples of such proetins include but are not limited to 
subtilisin Carlsberg, subtilisin BPN 1 , proteinase K, and 
Thermitase. When the target protein is BIAP, one 

preferred reference protein is Thermitase. Thermitase is an 

35 extra-cellular subti lis in-like serine protease isolated 

from Thermoactinoayces vulgaris (Frommel, c. , et al. (1978) 
Acta Biol. Med. Cer. 37:1193-1204). The protein amino acid 
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sequence of thermitase is 42% identical to BIAP. The high 
degree of similarity between these two proteins provides an 
ideal system with which to examine packing differences that 
affect BIAP stability. In this second step the three 
5 dimensional structures of Thermitase and BIAP are aligned 
using the computer program QUANTA™. The three dimensional 
alignment is carried out by first aligning the primary 
sequences of the two proteins to determine which amino 
acids are equivalent. This is accomplished using FASTA 

10 (Myers, E.W. , and Miller, W. (1988) Comput. Applic. Biosci. 

4:11-17; Pearson, W.R., and Lipman, D.J. (1988) Proc. Natl. 
Acad. Sci. USA 85:2444-2448). Based on this alignment of 
the primary sequence, residues are matched for subsequent 
alignment of the three dimensional structures using HULTLSQ 

15 (sutcliffe, M.J., et al. (1987) Protein Eng. 1:377-384; 

Kabsch, W. (1976) Acta Cryst. A32:922-923) . This program 
uses one structure as fixed coordinates (the target protein 
coordinates) and then rotates and translates a second 
structure (the reference protein coordinates) so as to give 

20 the smallest root mean squared (r.m.s.) deviation between 
the two sets of three dimensional coordinates. For 
example, the alignment of the BIAP and thermitase three 
dimensional coordinates results in an r.m.s. deviation 
between equivalent o-carbons of 0.8 A. This demonstrates 

25 that the amino acid sequences of BIAP and thermitase fold 

into three dimensional structures which are extremely 
similar. 

In the third step, the alignment of the three 
dimensional structures is used to identify sites which 

30 affect the stability of the target protein. This can be 

accomplished by a variety of methods. Using a computer 
program designed to display protein structures and surfaces 
such as QUANTA™, the structure of the reference protein can 
be displayed with the probe-accessible surface. The 

35 combined display of the reference protein and probe- 

accessible surface can then be visually examined to 
determine which amino acids in the reference protein fall 
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outside of the solvent-accessible surface or Inside 
internal cavities. An alternative method which can be used 
comprises coloring the atoms of the reference protein by 
determining whether amino acids in the reference protein 
5 fall outside of the solvent-accessible surface or inside 

internal cavities. The probe-accessible surface map (probe 
nap) was used to color the atoms in the transformed 
subtilisin BPN' structure. In order to color each atom, an 
energy value needs to be interpolated from the probe map at 
10 each atomic coordinate. 

The probe map consists of three dimensional grid with 
an energy value (E) at each grid point. In the preferred 
embodiment, the probe map is a 50x50x50 A box centered on 
the center of mass of the protein with a l A grid unit in 
all three dimensions (x, y, and z) . In its optimal 
conception, the size of the probe map is adjusted such that 
all of the protein atoms fall within the probe map's 
bounds. The energy value at each protein atom position was 
approximated by interpolating from the energy values from 
the surrounded eight grid points in the probe map. Given 
the energy value at each point from the probe map, the grid 
spacing, and the atomic coordinate, it is a simple matter 
for any one skilled in the art to interpolate an energy 
value at each atomic coordinate. 
25 m one such method, an energy value of aero is 

assigned arbitrarily if an atom falls outside the bounds of 
the map. Prom a given atomic coordinate <x,y f z), the eight 
closest grid points from the probe map which surround 
(x,y,z) are identified such that (x A < x < x 2 ) , 
30 (y[ < y < y 2 )r and {z x < z < z 2 ) . The eight grid points are 

then A <x 1# y a , z x ) . B <x a , y 1( z 2 ) , C <x 2 , y 2 , z 2 ) , 

D <X a , y 2 , * < X 2' V l' 2 1>' P y »» 2 * } ' 

6 <x 2 , y 2 , z 2 ), and H (x 2 , y 2 , « a ) . The energy value (E) at 
a given grid point such as <x a , y lt Zj) is then Ets,*,.*,) or 
35 equivalents E x . The energy at a specific atomic coordinate 

E , fI) cm be interpolated from the probe map given the 
eight nearest surrounding grid points (A through H, 
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cavities; (2) identifying the amino acids which make up the 
boundaries of the internal cavities, wherein .aid amino 
acids comprise a set of sites which when mutated increase 
the stability of the protein; (3) identifying an amino acid 

5 mutation which would decrease the volume of said internal 
cavities; (4) determining if said amino acid in said target 
protein can be changed without creating unacceptable steric 
interactions; (5) replacing the amino acid in said target 
protein by site-directed mutagenesis of the gene which 

10 expresses said target protein. 

The first two steps of the above first method for 
improving the stability of a protein are the same as those 
disclosed above for the first computer based method for 
identifying the sites which affect the stability cf a 

15 protein. k . 

in step (3) an amino acid identified in step (2) is 
examined with the goal of identifying a mutation which 
would decrease the volume of said internal cavity. The 
size, shape and position of said internal cavity often 
20 defines and limits what mutations are acceptable and 

allowable given the distinct shape and size of each 
individual amino acid side chain. However, as a particular 
site in the protein has been identified for mutation, 
appropriate mutations can be also be determined by applying 
25 any of the various heuristics which define generally 
acceptable mutations (Matthews, B.W., et al. (1987) Proc 
Natl. Acad. Sci. 84:6663-6667; Henendez-Arias, L., and 
Argos, P. (1990) J. Mol. Biol. 206:397-406; Sandberg, W.S., 
and Terwilliger, T.C. (1991) Trends Biotecanol. 9:59-63; 
30 Bordo, D., and Argos, P. (1991, J. Mol. Biol. 217:721-729). 

in step <4> a determination is then made if the amino 
acid identified for change in the target protein can be 
mutated or changed without creating a conformation of the 
target protein having unacceptable steric interactions. The 
35 separation distance between two atoms considered 

unacceptably short is some percentage of the sum of the van 
der Waal radii of the two atoms in question. Values of 90- 
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95% of the sum of the van der Waal radii are common though 
others could be used. Common atoms between the original 
and replacement amino acid side chain are located and fixed 
in the same position. The new amino acid is rotated to 
5 find the position with the least number of close contacts 

or unacceptable steric interactions (distances shorter than 
physically reasonable) . The separation distance at which 
two atoms are considered unreasonably short is some 
percentage of the sum of the van der Waal radii of the two 
110 atoms in question. Values of 90-95% of the sum of the van 

der Waal radii are common though others could be used. If 
all conformations of the new amino acid have close 
contacts, the amino acid substitution is rejected. A 
conformation with no close contacts which can be matched to 

15 a preferred amino acid conformation as defined by Ponder, 

J.W., and Richards, F.H. (1987) J. Mol. Biol. 193:775-791, 
is most highly desirable. In step (6) the amino acid 
identified for change to the corresponding amino acid in 
the same position in the reference protein is changed by 

20 site-directed mutagenesis of the gene which expresses the 

target protein by the methods disclosed above. 

The second method comprises the steps of: 
(1) generating a probe-accessible surface of said target 
protein by probing the three dimensional coordinates of 

25 said protein with an uncharged probe molecule having a 

radius of about 0.9 to about 2.0 A, wherein said probe- 
accessible surface has an external surface the interior of 
which contains one or more probe-accessible internal 
cavities; (2) aligning said three dimensional coordinates 

30 of said target protein and a reference protein by moving 
the three dimensional coordinates of said reference protein 
into the coordinate frame of said target protein; 
(3) identifying an amino acid in said reference protein 
whose side chain lies outside said solvent-accessible 

35 surface of said protein or inside said internal cavities of 

said target protein; (4) identifying the amino acid in said 
target protein which occupies the equivalent position as 
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said amino acid in said reference protein; (5) determining 
if said amino acid in said target protein can be changed 
without creating unacceptable eteric effects; (6) replacing 
the amino acid in said target protein with the 
5 corresponding amino acid in the equivalent position in said 

reference protein by site-directed mutagenesis of the gene 
which expresses said target protein. 

The first three steps of this method are the same as 
steps (1), (2), and (3) of the second method for the second 

10 computer based method for identifying the sites which 

affect the stability of a protein. 

In step (4) the amino acid in the target protein which 
occupies the equivalent position as the amino acid in the 
reference protein is identified. Equivalency is determined 

15 from the primary sequence alignment and three dimensional 

structure alignment described above. Given two protein 
structures, a target and a reference structure, which have 
been aligned, equivalent amino acids are defined as pairs 
of amino acids, one from the target and one from the 

20 reference protein, which may differ in identity but occupy 

close to the same position in the secondary and tertiary 
structure of the two proteins. 

The following examples are meant to illustrate but not 
to limit the invention. 
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Example 1 

Identification of Bites in BLAP for mutagenesis 

The structure of BIAP was obtained by X-ray 
crystallography and solved to 1.4 k. The atomic coordinates 
5 are shown in Figure 1. Water molecules vere removed from 
the structure and the protein coordinates vere used to 
generate a probe-accessible surface using a computer 
program QUANTA™ (version 3.0) . This program can be used to 
calculate a probe-interaction map. The coordinates of BLAP 
10 vere read into the computer and the following parameters 
vere set in order to perform the probe interaction grid 
calculation. A Van der Waal calculation was requested with 
a "proton" probe (radius of 0.9 A) with a charge of 0.0. 
The box dimensions were set to 50 A with a grid sire of 
1 A centered on the a-carbon of residue 219. The maximum 
energy was set to 500 and the minimum to -100. This means 
that energy values which exceed 500 will be set to 500. An 
energy value will exceed 500 when the probe is very close 
to an atom in the protein. The calculations vere performed 
on a Silicon Graphics Inc. (2105 Landings Drive, Suite 
2105 r Mountain View, CA 94043) 4D/220 Powerlris™ 
workstation. QUANTA™ was used to visualize the probe- 
accessible surface. The nap was contoured at 50 Xcal/mol 
but this value depends on the particular constants in use 
and the method used to generate the probe accessible 
surface. The map was displayed simultaneously with the 
structure of BLAP and amino acid side chains which defined 
the boundaries of the internal cavities were identified 
visually. 

One such amino acid was threonine- 6 9 . This side chain 
is completely buried with only 2% of its surface being 
solvent accessible. The hydroxy 1 group of the side chain 
defined part of the border of two internal cavities. These 
particular cavities are occupied by water molecules 278 on 
one side, and 280 on the other. Mutating this amino acid 
to valine represents a conservative change which increases 
the hydrophobicity of the side chain while having little 
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effect on size and shape. Using computer modeling, it vas 
determined that mutating threonlne-69 to valine would not 
create any close contacts with other protein atoms or 
significantly perturb the structure if the valine occupies 
5 the same position as the hydroxy 1 of tbreonine-69 in the 
wild type protein. An oligonucleotide vas synthesized 
which carried a mutation of the codon for threonine-69 to 
valine (T69V) • This oligonucleotide vas used to create a 
site directed mutation in the BLAP gene which vas subcloned 
10 into a Bacillus vector and expressed in £. subtilis DB104 

(See Examples 4 and 5). Strains were identified which were 
expressing the mutant protease and several shake flasks 
were prepared to produce the mutant protein (See Example 
5) . The mutant protease vas purified from the shake flask 
15 media and characterized for surfactant and temperature 
stability (See Examples 7, 10, and 11) . 

The mutation T69V resulted in a 340% increase in the 
half -life of the protease at 50°C, from 12 minutes to 41 
minutes (See Table 3). 
20 Example 2 

Identification of Sites in BLAP for Mutagenesis 
based on other proteases. 
(A) Comparison to subtilisin carlsberg. 

The three dimensional coordinates of subtilisin 
25 Carlsberg (1CSE) vere obtained from the Brookhaven Protein 
Database (Bernstein, P.C, et al. (1977) J. Mol, Biol. 
112:535-542). The protease structures vere aligned using 
the molecular modeling program QUANTA™. The BLAP 
coordinates vere held fixed. The a -carbons of residues 1 
30 to 32 of BLAP vere matched to residues 1 to 32 of 1CSE, 

respectively; residues 40 to 60 of BLAP to residues 41 to 
61 of 1CSE; residues 80 to 155 of BLAP to residues 82 to 
157 of 1CSE; residues 170 to 269 of BLAP to residues 176 to 
275 of 1CSE. The BLAP structure vas held fixed, and the 
35 1CSE structure va6 rotated and translated such that the 

r.m.s. deviation betveen the a-carbons of matched residues 
vas minimized. The translation vector 
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(-10.68738, 31.28904, -5.32134) and the rotation matrix 
( 0.17406 -0.65535 0.73500 
-0.42119 -0.72422 -0.54599 
0.89011 -0.21454 -0.40209 ) 
were applied to the coordinates of ICSE and the transformed 
coordinates were saved (henceforth, the transformed ICSE 
structure) . The final r.m.s. deviation between the matched 
229 a-carbon pairs was 0.872 A. 

The probe-accessible surface nap calculated in 
Example 1 was used to color the atoms in the transformed 
ICSE structure. The entire map, which consists of three 
dimensional grid of (x, y, s) coordinates in space and an 
energy value at each position, was read into computer 
memory along with the protein coordinates (the transformed 
ICSE structure) . The energy value at each atom position 
was approximated by interpolating from the energy values of 
the surrounding eight nearest grid points in the probe map. 
The protein atoms were colored on the basis of this 
interpolated energy value. The protein was displayed using 
QUANTA™ and atoms were displayed in different colors 
depending on their interpolated energy value. For example, 
if the energy were greater than 400 the atoms were dark 
blue; between 300 and 400, light blue; 200 and 300, green; 
200 to 100 yellow; and between -100 and 100, red. Visual 
inspection of such a display allowed identification of side 
chains which penetrated the solvent accessible surface or 

internal cavities. 

One such amino acid was methionine- 19 9 (ICSE 
numbering) in subtilisin carlsberg. The amino acid was 
identified by visual inspection of the transformed ICSE 
structure (as described above) . Below, the coordinates of 
residue 199 from the transformed ICSE structure are shown 
in the Brookhaven Protein Data Bank file format along with 
the interpolated energy values. 
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Coordinates of Methionine- Iff 
fre» the 1.2 A structure of subtilisin Carlsberg. 

22.392 40.705 32.311 1.0 500.00 

21.675 40.561 31.0S4 1.0 500.00 

22.436 39.677 30.103 1.0 500.00 

23.689 39.601 30.254 1.0 500.00 

21.621 41.991 30.511 1.0 500.00 

20.868 42.994 31.426 1.0 500.00 

lO ATOM 1370 SD MET 1*9 19.150 42.631 31.891 1.0 211.58 

ATOM 1371 CE MKT 199 18.273 43.395 30.493 1.0 41.68 

Column 1 is the record type; column 2 is the atom number; 
column 3 is the atom name; column 4 is the residue name; 

15 column 5 is the residue number; columns 6, 7 * 8 are the 
x, y, z coordinates of the atom, respectively; column 9 is 
the occupancy; column 10 is normally the temperature factor 
but this has been replaced with the interpolated energy 
value. Note that a value of 500 in this column means that 

20 the atom in nearly completely within the van der Waal 
surface of the BIAP molecule. When the probe map was 
calculated (see Example 1) , energy values greater than 500 
were set to 500. As can be seen, atoms 1370 and 1371 have 
significantly lower energy values (column 10) . The end of 

25 this methionine residue extends into an internal cavity in 
the BIAP molecule. 

This residue is equivalent in secondary and tertiary 
structure to valine-193 in BIAP. Using computer modeling, 
valine-193 in BIAP was changed to methionine. The % values 

30 for the new methionine side chain in BIAP were taken from 
the subtilisin BPN' structure. In this conformation, the 
new side chain had no close contacts except for the e- 
carbon of the methionine which contacted a crystal lographic 
water in the BIAP structure. 

35 An oligonucleotide was synthesized which mutated the 

codon for valine-193 to methionine (V193M) in the BIAP 
gene. This oligonucleotide was used to create a site 
directed mutation in the BIAP gene which was subcloned into 
a Bacillus vector and expressed in B. subtilis DB104 (See 

40 Examples 3, 4, and 5) . Strains were identified which were 
expressing the mutant protease and several shake flasks 
were prepared to produce the mutant protein (See Example 
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5). The .utarit protease was purified from the shake flask 
B edia and characterized for teaperature and surfactant 
stability (See Examples 6, 7, 10, and 11). 

The nutation V193M resulted in a 350% increase in the 
5 half-life of the protease at 50-C, from 12 minutes to 42 

minutes (See Table 3). 
CB) Comparison to Thermitase. 

The three-dimensional coordinates of thermitase (1TEC) 
were obtained from the Brookhaven Protein Database 
10 (Bernstein, F.C., et al. (1977) J. Mdl. Biol. 112:535-542). 

The structures of BIAP and 1TEC were aligned using the 
molecular modeling program QUANTA™ by matching equivalent 

a-carbons as listed below. 

Matched a-carbons between 
BIAP and Thermitase (1TBC) 

5-20 12-27 

23-34 29-41 

43-72 52-81 

75-227 85-237 

20 232-256 240-264 

The BIAP structure was held fixed and the 1TEC structure 
was rotated and translated such that the r.m.s. deviation 
25 between the e-carbons of matched residues was minimized. 

The translation vector (14.92521, 33.43270, 40.92134) and 

the rotation matrix 

( 0.79048 -0.20395 -0.57753 
-0.01688 0.93532 -0.35340 
0.61225 0.28911 0.73591 ) 
were applied to the coordinates of 1TEC and the transformed 
coordinates were saved (henceforth, the transformed 1TEC 
structure) . The final r.m.s. deviation between the matched 
236 o-carbon pairs was 1.384 A. 
35 The probe-accessible surface map was used to color the 

atoms in the transformed 1TEC structure. The entire probe 
nap was read into computer memory along with the 
coordinates of the transformed 1TEC structure. The energy, 
value at each atomic position was interpolated from the 
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energy values of the eight surrounding grid points in the 
probe nap. The protein was displayed using QUANTA™ and 
atoms were displayed in different colors as a function of 
their interpolated energy value. For example, if the 
5 energy were greater than 400 the atoms were dark blue; 

between 300 and 400, light blue; 200 and 300, green; 200 to 
100 yellow; and between -100 and 100, red. Visual 
inspection of such a display allowed identification of side 
chains which penetrated the solvent accessible surface or 
10 internal cavities. 

One such amino acid was tyrosine-149 (1TEC numbering) 
in thermitase. The amino acid was identified by visual 
inspection of the transformed 1TEC structure. Below, the 
coordinates of residue 149 from the transformed 1TEC 
15 structure are shown in the Brookhaven Protein Data Bank 

file format along with the interpolated energy values. 

Coordinates of Tyrosine-149 
from the 2.0 A structure of Thermitase. 

ATOM 10S2 H TYR 149 19.783 23.026 47.326 1.0 500.00 

20 ATOM 10S3 CA TYR 149 20.372 21.668 47.275 1.0 500.00 

ATOM 1054 C TYR 149 21.456 21.557 46.165 1.0 500.00 

ATOM 1055 O TYR 149 22.619 21.330 46.486 1.0 500.00 

ATOM 1056 CB TYR 149 19.282 20.595 47.169 1.0 500.00 

ATOM 1057 OS TYR 149 19.8S9 19.183 46.935 1.0 227.30 

25 ATOM 1058 CD1 TYR 149 20.262 18.427 48.038 1.0 79.13 

ATOM 1059 C02 TYR 149 20.014 18.722 45.608 1.0 275.01 

ATOM 1060 CE1 TYR 149 20.762 17.146 47.807 1.0 10.99 

ATOM 1061 CE2 TYR 149 20.531 17.425 45.371 1.0 500.00 

ATOM 1062 « TYR 149 20.860 16.649 46.488 1.0 131.28 

30 ATOM 1063 OH TYR 149 21.165 15.337 46.282 1.0 Si .29 

Column 10 is normally the temperature factor but this has 
been replaced with the interpolated energy value. As can 
be seen, the phenyl ring of the tyrosine side chain has 
significantly lower energy values (column 10 of atoms CG, 
CD1, CD2, CE1, CE2 and CZ) . 

This residue is eguivalent in secondary and tertiary 
structure to serine-139 in BIAP. Using computer modeling, 
serine-139 in BIAP was changed to tyrosine. The % values 
40 for the new tyrosine side chain in BIAP were taken from the 

thermitase structure. In this conformation, the new side 
chain had no close contacts that could not be alleviated by 
small changes (less than 5°) of the % values. The modeled 
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, ' «hain in BL*P f iti neatly into a crevice on 

tyr osin. .ide chain in ^ 

the surface or we * 

"'""n'oligonucleotid. was synthesize which mutated th. 

? «Line-139 to tyrosine (S139Y) in th. B1AP gen.. 
5 codon for serine ,139 J , ^ dir . =ted 

This on^onucXeotx^s us ^ ^ , 

amtation in the *™*J subtlli , OB104 (Se. 

Bacillus vector and e^ressed^ ^ ^ ^ ^ 

Examples ' < ' and s . veral shake fl.sfcs were 

10 eXPr ^: to p^uce "the mutant protein ' (See Example 5, . 

prepared to P« to. the sha*. f las* 

^r^d Xr" S teri«d 'for temperature and surfactant 
stability (See Examples 6, 7, 10, and 11) . 
1S "" Station S139Y resulted in a 21« increase in the 
half-life of the protease at 50-C. fro. 12 mxnute, to « 
minutes (See Table 3). 

Example 3 

Sit. Directed Mutagenesis of th. BLAT gene 
This mutagenesis procedur. was first described by 
20 „ . t al. (1989) Huel.ie Acids Res. 

"™; 5< KhU. this is the preferred ..thod -any 
17.4441 * introduce oligonucleotide 

other methods could be us which use 

•^♦---directed mutations, particularly wo 
site-directea example, the method of KunXel 

h " ^^"''oligonucleotide was synthesis which 
states L codon of threonine-., to the codon for ^ valine^ 

30 Th . mutagenic Z ^ » ■ — " 

duplex DMA which carries tne auu- ? 

single stranded (ss) DHA. The gapped duplex (,d) was 
forced by denaturing linear DNA's fro. P »cl3C ****** 
flowed by re-annealing. Th. mutagenic oligonucleotid. 
35 annealed to homologous s. BLAP DHA within th. gap and the 

35 anne. fined in by a DMA polymerase and llgated 

— treatment of th. product 
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of the fill-in reaction with ExoIU removed any species 
with only partially filled gaps. 

The product of the fill-in reaction was then 
transformed into a repair deficient E. coll strain such as 
5 WK6mutS. Plasmid DNA from the recombinant E. coii WK6mutS 
was prepared and transformed in a low plasmid/recipient 
ratio into a suppressor deficient strain of E. coli such as 
WK6. Ampicillin resistant transf ormants were selected and 
plasmid DNA of several candidates was purified and checked 

10 for the presence of the mutation. 

The mutant BLAP protease was expressed by transferring 
the mutant BLAP genes from their particular E. coli pMal3C 
derivative vector into a plasmid vector which can replicate 
in Bacillus such as pH70 or pC51. In the following 

15 example, the plasmids pC51 and pH70 can be used 

interchangeably with the exception that plasmid pH70 
encodes resistance to kanamycin while plasmid pC51 encodes 
resistance to tetracycline. The mutant BLAP gene was 
separated from the pMal3C plasmids by digestion with the 

20 restriction endonucleases Aval and SstI and then ligated 

with an Aval-SstI cut fragment of plasmid pH70 that 
includes the regions necessary for kanamycin resistance and 
for replication in Bacillus. The pH70 Aval-SstI fragment 
was pur if ied by high pressure liquid chromatography (HPLCJ . 

25 After ligation the mutant BLAP plasmids were transformed 

into B. subtilis DB104, a strain that has been engineered 
to inactivate its own genes encoding the major alkaline and 
neutral proteases. B. suJbtilis DB104 transformed by these 
plasmids were grown on a nutrient-skim milk agar in the 

30 presence of the antibiotic kanamycin. Clones that 
manufactured mutant protease were identified by the 
formation of clear zones of hydrolysis in the skim milk. 
Plasmid DNA was purified from these clones to verify that 
the protease-producing clones carried the a plasmid-borne 

35 BLAP gene with the desired mutation. The plasmid DNA was 

purified away from cell protein and chromosomal DNA by SDS- 
salt precipitation followed by chromatography over a Qiagen 
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lon-.xch.ng. column (Ql^n Corporation), 
dige.ted pl.»ld DNA. tr> different clone. were co.par«d 
with Avat/Sstl-digested d.riv.tiv.. of pl.eaid P H70 knovn 
to carry an intact BLAP gene. PUnid di,e.t. were =°»P«« d 

5 by agarose g.l electrophoresi. to identity pla.»id. . that 

have the proper-.i«d Aval/SstI DNA fragment.. Selected 
planid DMAs were then sequenced aero., the region of the 
particular BLAP station to confir. that the mutagen va. 
present. On. or more done, of each BLAP station were 
stored frozen in 15% glycerol at -70-C and also cultivated 
in shake flask. (Example. « and 5) to -anuf.cture mutant 
protease for characterization. 

Exaaple 4 
Productioa of Protea.e. 

1S Each strain of B. suttilis DB104 that carried a 

plasmid with one of the mutant BLAP genes was cultivated xn 
shaKe flasks to make the mutant protease Strains were 
grown in 50 ml precultures of (Difco) Luna Broth (LB) with 
the antibiotic kanamycin for P H70 derived clones or 
tetracycline for P C5l derived clones at 37-C and 280 rpm xn 
a New Brunswick Series 25 Incubator Shaker. After 7 to 8 
hours of incubation 2.5 or 5.0 ml of the Preculture was 
transferred to 50 or 100 ml of MLBSF medium (Table 5) , 
respectively, with either 20 „g/»l of kanamycin, or 

25 15 „g/ml of tetracycline in 500 ml (Bellco) baffled sha*e 

flasks for growth and eventual production of the protease. 
These main shake flask cultures were incubated at 240 rpm 
and 37 -C for 64 hours before the culture broths were 
treated to remove intact cells and cellular debris, and to 

30 reduce the pH to 5.8 before they were concentrated. The 

protease production of each culture was monitored by 
electrophoresis of culture supernatants with reverse 
polarity on 12.5% homogenous polyacrylamide gels with the 
Pharmacia Phastsystem. 
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A hot loop was used to streak each mutant strain from 
a frozen cryovial culture onto an LB-skim milk agar 
containing either 20 ng/ml of kanamycin or 15 jig/ml of 
tetracycline. The plates were incubated at 37 # C for 20 to 
5 24 hours. A single, isolated colony producing a good zone 
of hydrolysis of the skin milk was picked into a 250 ml 
Erlenmeyer flask containing about 50 ml Luria Broth (LB) 
which contained either 20 pg/ml kanamycin or 15 jig/ml of 
tetracycline. The broth was incubated in a New Brunswick 

10 Series 25 Incubator Shaker at 37 «C with shaking at 280 rpm 
for 7 to 8 hours. Either 2.5 ml of the turbid preculture 
was transferred into 50 ml of MLBSP containing either 
20 M9/ml kanamycin or 15 ng/nl of tetracycline in each of 
four baffled 500 ml flasks, or 5 ml of preculture was used 

15 as an inoculum for 100 ml of MLBSP broth with antibiotic 

contained in each of two 500 ml baffled flasks (a 5% v/v 
transfer). All flasks were incubated at 240 rpm and 37°C 
for 64 hours. After 64 hours of incubation the set of 
flasks for each culture was consolidated, transferred to 50 

20 ml centrifuge tubes, and centrifuged at 20,000 g 4T for 15 
minutes at 4»C. The broth was filtered through Miracloth 
(Calbiochem Corp. Z475855) into 400 ml beakers chilled on 
ice. The broth was slowly stirred on ice for 30 minutes 
before the broth pH was reduced to 5.8 by the slow addition 

25 of glacial acetic acid. More fine debris were removed by 

centrifugation again at 20,000 g av and the broth was 
filtered through Miracloth into graduated cylinders to 
measure the volume. Two sets of 1 ml samples were made for 
PhastSystem gels and activity assays. The broth was stored 

3 0 on ice until the protease could be purified. The MLBSP 
media used for the production of B1AP in shake flask 
cultures is described in Table 5. 

35 
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TABLE 5 
COMPOSITION OF MLBSP MEDIUM 

Quantity (for 1 liter of media) 



deionized water 
Difco Casitone 
Difco Tryptone 
Difco Yeast Extract 
MaCl 

Sodium Succinate 



750 


ml 


10 


cpn 


20 


gin 


10 


gin 


5 


gin 


27 


gin 


Of 


7. 



SSSTtS ^ 1 a .% 1 tU b s , ci"tiorae S =rib.d in Appendix 1. 
while stirring. 

APPENDIX 1 (additions to MLBSP broth) 



Component 



Quantity 
(for 1 L of media) 



40 



MgS0 4 . 7H 2 0 
CaCl 2 .2H 2 0 
FeS0 4 .7H 2 0 
MnCl 2 . 4H 2 0 
Glucose 
PIPES Buffer 1 
Buffer 2 



(100 mg/ml stock, autoclaved) 
(30 mg/ml stock, autoclaved) 
(lmM stock, filter sterilized) 
(lmM stock, autoclaved) 
(25% (w/v) stock, autoclaved) 
( P H 7.2, 1 M stock, autoclaved) 50.0 ml 
(1.5 M stock, autoclaved) 50.0 ml 



1.0 ml 
2.5 ml 
0.5 ml 
0.5 ml 
80.0 ml 



45 



50 



* *ipera*ine-H,N'-bi«<2-ethane aulfonic acid) 



200 ml of l.S K monobasic f^f "P h *^,^^*/ t h "."iickian colnbi-nation 

Either kanamycin or tetracycline antibiotic stock solutions 
were added to the media just before use to a final 
concentration of 20 *g/nl and 15 Mg/»1 respectively. 
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Example < 

Purification of BLXP 

Fermentation broth of transformed B. subtilis DB104, 
while still in the fermenter, was adjusted to pH 5.8 with 
5 4 N H 2 S0 4 . The broth was collected and cooled to 4 # C. If 

not mentioned otherwise, all subsequent steps were 
performed on ice or at 4*C. An aliquot of the broth 
material was clarified by centrif ugation at 15 , 000 x g av . 
for 60 min. Floating lipid material was removed by 

10 aspiration, and the supernatant filtered through Miracloth. 

The dark brown solution was placed in dialysis tubing 
(Spectrapor; #1, 6 to 8 kilodalton (kDa) molecular-weight- 
cut-off, 1.7 ml/cm) and dialyzed for 16 hours in 20 mM 2- 
(N-morpholino)ethanesulfonic acid (MES) containing 1 mM 

15 CaCl 2 , adjusted with NaOH to pH 5.8 ( 'MES buffer*). The 

dialysate was clarified by centrif ugation (20,000 x g av< for 
10 min) and the pH of the solution was adjusted to 7.8 with 
2 N NaOH. The enzyme solution containing approximately 
0.9 g of protein in 1.2 liter was loaded at a flow rate of 

20 150 ml/hour onto a column of S-Sepharose Fast Flow (SSFF, 

Pharmacia; 25 mm diameter, 260 mm long) previously 
equilibrated with 20 mM N-(2-hydroxyethyl)piperazine-N , -(2- 
ethanesulf onic acid) [HEPES], containing 1 mM CaCl 2 , 
adjusted with NaOH to pH 7.8 (•HEPES buffer 1 ). After the 

25 application of the enzyme solution the column was washed 

with 2 column volumes (250 ml) of HEPES buffer and then 
developed at a flow rate of 140 ml/hour with a gradient of 
0 to 0.25 M NaCl in 600 ml of HEPES buffer. The gradient 
eluate was fractionated into 5 . 2 -ml aliquots which were 

30 collected into tubes containing 2 ml of 100 mM MES/Na*, 

pH 5.8. The enzyme eluted between 0.12 and 0.15 M NaCl, 
Fractions containing the enzyme were pooled and protein was 
precipitated with ammonium sulfate at 52% of saturation. 
Solid salt (0.33 g per ml of solution) was added slowly 

35 with stirring over a period of 15 min, and stirring was 
continued for another 15 min. The precipitate was 
collected by centrif ugation, the pellet was dissolved in 
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HES buffer and the protein concentration in the solution 
was adjusted to 5 to 7 mg/ml. Following dialysis for 
16 hours in KES buffer the solution was clarified by 
centrifugation and the pH of the supernatant was adjusted 
5 to 7.2. The protease was purified further by a second 
cation exchange separation on SSFF. All steps of this 
procedure were the same as above except that the pH of the 
HEPES buffer was 7.2 and that the NaCl gradient was from 0 
to 0.25 M in 600 ml of HEPES buffer. Protein in pooled 

10 fractions was precipitated as above with ammonium sulfate 
and the enzyme was stored as ammonium sulfate precipitate 
at -70°C. Prior to use the ammonium sulfate precipitate of 
the enzyme was dissolved in an appropriate buffer, 
typically KES buffer, at the desired protein concentration, 

15 and dialyzed overnight in the buffer of choice. 

Example 7 
Purification of BLXP Mutants 
Fermentation broth from shaJce flasks, on average 
180 ml, was collected and clarified by centrifugation at 

20 20,000 x g av . for 15 min. The supernatant was placed, with 
stirring, on ice and after 30 min the pH of the solution 
was adjusted to 5.8 with glacial acetic acid. If not 
mentioned otherwise, all subsequent steps were performed on 
ice or at A*C. The solution was clarified again by 

25 centrifugation (20,000 x g av . for 15 min) and was 
concentrated approximately 4-fold by ultrafiltration 
(Amicon; YM30 membrane). The dark brown solution was 
placed in dialysis tubing (Spectrapor; #1, 6 to 8 KDa 
molecular-weight-cut-off, 1.7 ml /cm) and dialyzed for 

30 16 hours in 20 mM HEPES/Na*, pH 7.8, containing 1 mM CaCl 2 
(•HEPES buffer 1 )- The dialysate was clarified by 
centrifugation (20,000 x g av . for 10 min) and the pH of the 
solution, if necessary, was adjusted to 7.8 with 2 N NaOH. 
The enzyme solution was loaded at a flow rate of 60 ml/hour 

35 onto a column of SSFF (15 mm diameter, 75 mm long) , 
previously equilibrated with HEPES buffer. When all 
colored by-products were eluted, the column was washed with 
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50 ml of HEPES buffer. Then, the enzyme was eluted with 
0.25 M NaCl in HEPES buffer. Fractions of 1.2 ml were 
collected into tubes containing 0.5 nl of 100 mM KES/Na*, 
pH 5.8. Protein content in fractions was Monitored either 
by a uv detector set at 280 na or by protein assay as 
described below. Pooled fractions containing protease 
protein were placed on ice and protein vas precipitated 
with a 5 to 8-fold volume excess of acetone at -20»c. The 
protein was allowed to precipitate for 6 min, the mixture 
was centrifuged for 4 min at 6,600 x g av , the supernatant 
was discarded, the pellet was briefly exposed to vacuum 
(water aspirator) to remove most of the acetone, and the 
pellet was dissolved in 20 mM MES/Na + , pH 5.8, to give an 
approximate protein concentration of 30 mg/ml. Prior to 
any assays, the solution was centrifuged in an Eppendorf 
centrifuge for 3 min at full speed (13,000 x g max ). 

Example 8 
Protein Determination 
Protein was determined by a modified biuret method 
(Gornall, A.G., et al. (1948) J. Biol. Che*. 177:751-766). 
The protein in a total volume of 500 nl vas mixed with 
500 Ml of biuret reagent and incubated for 10 min at 50°C. 
The solution was briefly chilled and its absorbance was 
measured at 540 nm. Typically, a reagent blank and three 
different protein aliguots in duplicates were measured and 
the recorded optical densities analyzed by linear 
regression. Bovine serum albumin (BSA, crystalline; 
Calbiochem) was used as protein standard. With purified 
BLAP protein the usefulness of BSA as protein standard in 
the biuret assay was confirmed. A BLAP sample was 
exhaustively dialyzed in 1 mM sodium phosphate, pH 5.8, and 
subsequently lyophilized. A sample of the solid material 
was weighed, dissolved in 1 mM sodium phosphate, pH 5.8, 
and used to generate a standard curve for the biuret assay. 
From the actual difference in phosphate content (Black, 
M.J., and Jones, M.E. (1983) Anal. Biochem. 135:233-238) of 
the final protein solution and the nominally l mM sodium 
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phosphate solution used to dissolve the protein, the 
contribution of phosphate to the weight of solid BLAP was 
estimated and used to correct the standard curve. 

Example 9 
Protease Assays 
Two different protease assays were used* With the HPE 
method protease activity was established at a single 
concentration of casein (prepared according to Hammarsten; 
Merck, #2242) as substrate. In the AAPF-pNA assay initial 
ratesofsuccinyl-L-alanyl-L-alanyl-L-prolyl-L-phenylalanyl- 
p-nitroanilide (AAPF-pNA; Bachem) supported catalysis were 
used to determine the kinetic parameters K., k C8t , and 

^cat/Km* 

A. HPE Method. 

Culture supematants or solutions of purified 
proteases were diluted with chilled buffer (10 mM MES/Na + , 
pH 5.8) to give three different solutions with a protein 
concentration ratio of 1:3:5. The substrate solution 
contained 9.6 mg/ml casein, 24 mM Tris, and 0.4% (w/v) 
sodium tripolyphosphate, dissolved in synthetic tap water 
(STW; 0.029% (w/v) CaCl 2 «2H 2 0, 0.014% (w/v) MgCl 2 »6H 2 0, and 
0.021% (w/v) NaHC0 3 in deionized water) adjusted to pH 8.5 
at 50 *C, prepared as follows. With stirring for 10 min, 
6 g of casein was dissolved in 350 ml of STW. To this, 
50 ml of 0.3 M Tris in STW was added and stirring was 
continued for another 10 min. This solution was heated to 
70 # C, then allowed to cool slowly. At 50*C, the pH was 
adjusted to 8.5 with 0.1 N NaOH. When the solution reached 
room temperature, the volume was adjusted to 500 ml with 
STW, followed by the addition of 125 ml of 2% (w/v) 
pentasodium tripolyphosphate in STW, pH 8.5 (adjusted with 
3 N HC1) . The protease assay was started by adding 50 pi of 
protease solution to 750 jil of substrate solution placed in 
a 2.2 ml Eppendorf container preincubated for 10 min at 
50 *C. After 15 min, the reaction was terminated by the 
addition of 600 /xl of trichloroacetic reagent (0.44 M 
trichloroacetic acid, 0.22 M sodium acetate in 3% (v/v) 
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glacial acetic acid). The mixture was placed on ice for 
15 min, the precipitated protein removed by centrifugation 
for 8 sin (at 13,000 x g mAX „) and a 900 jil aliquot of the 
supernatant was mixed with 600 j*l of 2 N NaOH. The 
absorbance at 290 nm of this solution was recorded. Each 
dilution was assayed in duplicates and the data points for 
three different dilutions from one enzyme sample was 
analyzed by linear regression. A slope of 1 in this assay 
corresponds to 80 HPE units in the least diluted sample. 
In case of strongly colored culture supernatants with 
measurable quantities of UV absorbing material carried over 
by the diluted protease aliquot into the assay cuvette a 
control curve was constructed whose slope was subtracted 
from the slope of the protease assay before final HPE units 
were calculated. 
B. AAPF-pNA Assay 

Protease samples were diluted with 50% (v/v) 
l, 2 -propanediol in 100 mM Tris, adjusted with 2 N HC1 to 
pH 8.6 at 25°C ( •Tris-propanediol buffer") , in which they 
were stable for at least 6 b at room temperature. A stock 
solution of 160 mM AAPF-pNA was prepared in 
dimethylsulf oxide dried with a molecular sieve (Aldrich; 
4 k, 4-8 mesh) for at least 24 h prior to use. Fixed point 
assays were performed at 25°c with 1.6 mM AAPF-pNA in 
100 mM Tris, adjusted with 2 N HC1 to pH 8.6 at 25*C, in a 
total volume of 1.020 ml. The substrate was added to the 
assay buffer 1 min prior to the assay initiation and the 
reaction was started by addition of enzyme at a final 
concentration of 20 ng to 1.3 *ig of protein per ml (0.75 to 
48.5 nM enzyme) depending on specific activity. Release of 
p-nitroanilide was monitored at 410 nm, and a molar 
extinction coefficient of 8,480 M^csT 1 was used to 
calculate amount and concentration of product formed 
(DelMar, E.G. , et al. (1979) Anal. Biochem. 99:316-320). 
Kinetic parameters were calculated from a velocity vs. 
substrate concentration plot constructed from initial rates 
measured once each at 12 different AAPF-pNA concentrations 
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ranging from 0.16 to 3.2 mM. Data were fitted to a 
hyperbolic curve and proportionally weighted using the 
program EN Z FITTER (Leather barrow, R.J. (1987) EN Z FITTER, 
Biosoft, Cambridge, UK) . A nominal molecular weight of 
26.8 kDa was used in all calculations that required the 
interconversion of protein concentration and molarity of 
protease enzyme. 

Example 10 

Temperature Stability of Purified Proteases 

Stability of protease proteins was evaluated under two 
different conditions: (a) 100 mH glycine/Na*, pH 10 at 60 °C, 
and (b) 100 mM glycine/Na*, pH 11 at 50 »C. At t - 0 min, 
the protein was diluted to approximately 0.25 mg/ml into 
incubation buffer maintained at the desired temperature. 
Periodically, an aliquot was removed from this incubation 
mixture and diluted into Tris-propanediol buffer chilled on 
ice. Residual protease activity was determined by the 
AAPF-pNA assay at a fixed AAPF-pNA concentration (1.6 mM) . 
Stability is expressed as half -life (t 1/2 ) of activity 
determined from semi-logarithmic plots of residual activity 
as function of time. Each plot consisted of 6 data points 
with t 1/2 approximately in the center between experimental 
points. 

example 11 

Resistance of Proteases to Sodium Dodeeylsulfate (8D8) 

SDS was selected as representative of surfactants in 
general. Resistance of proteases to SDS was evaluated 
under two different conditions: (a) 100 mM Tris adjusted 
with 2 N HC1 to pH 8.6 at 50 # C, containing 1% (w/v) SDS, 
and (b) 50 mM sodium carbonate, pH 10.5 at 50*C, containing 
1* (w/v) SDS. Protease proteins were incubated at a final 
protein concentration of 0.25 mg/ml. Data were collected 
and evaluated as described above under Example 10. 

Example 12 
Polyacrylamide Gel Electrophoresis 
Purity of protease samples was evaluated on 20% non- 
denaturing PhastSystem gels (Pharmacia) run with reversed 
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polarity. The same system was used to monitor the protease 
content of crude shake flask and fermentation broths. 
Buffer strips were prepared as described in Application 
File No. 300 (Pharmacia). 
5 Molecular weight determinations were performed on 20% 

SDS PhastSystem gels, using the following markers: bovine 
serum albumin, 66 kDa; egg albumin, 45 kOa; glyceraldehyde- 
phosphate dehydrogenase, 36 kDa; carbonic anhydrase,- 
29 kDa; trypsinogen, 24 kDa; trypsin inhibitor, 20.1 kDa; 

10 o-lactalbumin, 14.2 kDa (all from Sigma). Prior to 
SDS-PAGE, a protease sample was denatured with formic acid 
at a final concentration of 30 to 50% (v/v) . Upon dilution 
of formic acid to 15% (v/v) protein was precipitated with 
trichloroacetic acid at a final concentration of 10% (v/v) . 

15 The collected pellet was washed once with water, then 

dissolved in 2% (v/v) SDS and heated for 2 min in a boiling 
waterbath. Gels were stained with Coomassie Brilliant Blue 
R-250 (Kodak). 

DEPOSIT OF MICROORGANISMS 

20 Living cultures of the following have been accepted 

for deposit under the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the 
purposes of patent procedure by tbe American Type Culture 
Collection, 12301 Parklawn Drive, Rockville, MD 20852 on 

15 May 8, 1991 (the accession number preceeds each deposit 

description) : ATCC 68614 - Bacillus licheniformis ATCC 
53926 strain which contains a tetracycline-resistance 
plasmid originally derived from Bacillus plasmid pBC16 
which carries the ATCC 53926 alkaline protease-BLAP Clal 

i0 fusion gene, whose structural gene has the mutations S3T, 
V4I, A188P, V193M, V199I; ATCC 68615 - E. coll WK6 which 
carries phasmid pMcl3C, a chloramphenicol-resistant 
derivative of phasmid pMc5-8, that contains the ATCC 53926 
alkaline protease- BLAP Clal fusion gene and a 164 bp Kpnl 

5 fragment carrying the ATCC 53926 alkaline protease gene's 
transcriptional terminator. The genotype of strain WK6 
are Alftc.-pxo.AB, galE, s£r.A, mutS ; ; TnlO/F ' lad*. 2AM15, 
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BtfiA + B + (Zell, R. , and Fritz, H. -J. (1987) EMBO J. 6:1809- 
1815)? ATCC 68616 - E. coii GM33 which carries plasmid 
pCB13C, an aapicillin-resistant derivative of Pharmacia 
plasmid vector pTZ19R (Pharmacia) that contains the ATCC 
5 53926 alkaline protease-Clal fusion gene. The GM33 

strain's genotype is djs3 (dam-methylase minus (Marinus, 
M.G. and Morris, N.R. (1974) J. Mol. Biol. 85:309-322)); 
ATCC 68617 - E. coll WK6 which carries phasmid pMa5-8, an 
ampicillin-resistant mutagenesis vector described in 

10 Stanssens, P. et al. (1989) Nucleic Acids Research 

17:4441-4454. The genotype of strain WK6 nutations are 
Alac-proAB, galE, s£r.A, iutS: :Tnl0/F'lacI*, ZAM15, proA + B* 
(Zell, R. , and Fritz, H. -J. (1987) EMBO J. 6:1809-1815); 
ATCC 68618 - an E. colt WK6 which carries phasaid pMc5-8, 

15 a chloramphenicol-resistant mutagenesis vector described in 

Stanssens, P., et al. (1989) Nucleic Acids Res. 17:4441- 
4454. The genotype of strain WK6 are Alac-proAB, galE, 
strA . ButS^TnlO/F'IacI^, ZAM15 , proA*B + (Zell, R. , and 
Fritz, H. -J. (1987) EMBO J. 6:1809-1815). 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Christiansen, Teresa 
Goddette, Dean W. 
Ladin, Beth F. 
Lau, Maria R. 
Paech, Christian 
Reynolds, Robert B. 
Wilson, Charles R. 
Yang, Shiov-Shong 

(ii) TITLE OF INVENTION: Third Generation Protease Mutants 

(iii) NUMBER OF SEQUENCES: 105 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Henkel Corporation 

(B) STREET: 14 0 Gennantown Pike, Suite 150 

(C) CITY: Plymouth Meeting 

(D) STATE : Pennsylvania 

(E) COUNTRY: USA 

(F) ZIP: 19462 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Drach, John E. 

(B) REGISTRATION NUMBER: 32891 

(C) REFERENCE/DOCKET NUMBER: M4922 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-832-2215 

(B) TELEFAX: 215-941-6067 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S3T, V4I, A188P, V193M, V199I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Ala Gin Thr He Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 
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Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val cly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp He 

180 185 190 

Met Ala Pro Gly Val Asn He Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 



Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S3T, A188P, V193M, V199I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Ala Gin Thr Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 
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His Asn Arg Cly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Cly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Glv Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
1 85 90 95 

Asd Glv Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
V 100 105 110 

Glv Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
J us 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 . 155 160 

Tvr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
* 165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp He 
180 185 190 

Met Ala Pro Gly Val Asn He Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 
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Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: V4I, A188P, V193M, V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Gin Ser He Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 
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Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tvr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp He 
180 185 190 

Met Ala Pro Gly Val Asn He Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSN 5843 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: S139Y, A188P, V193M, V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 

5 10 15 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 

Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 

55 60 



His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Le U 

70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 



no 



Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 

120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Tyr Ala Thr Ser Arg Gly 



140 



Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 

155 



160 



Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 1? £ 
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Asn Asn Asn Arg A!a ser Phe Ser Gin Tyr oly Pro Gly Leu Asp II. 

180 18:> 

Met Ala Pro Gly VI Asn He Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 



260 



(2) INFORMATION FOR SEQ ID HO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine f^otease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) V™^""^ S139Y , A188 p, Vl9 3M. V».I 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
! 5 10 1=> 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Thr Ala Thr Leu Glu Gin Ala Val Asn Tyr Ala Thr Ser Arg Gly 

130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp lie 
180 185 190 

Met Ala Pro Gly Val Asn lie Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 
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Arg Asn His Leu Lys Asn Thr Ala 

245 



His Asn Arg Gly Leu Thr Gly Ser 
20 



Thr Gly lie Ser Thr His Pro Asp 
35 40 



Phe Val Pro Gly Glu Pro Ser Thr 
50 55 



His Val Ala Gly Thr lie Ala Ala 
€5 70 



Gly Val Ala Pro Ser Ala Glu Leu 

85 



Thr Ser Leu Gly Ser Thr Asn Leu 
250 255 



Pro Ala Ala 
15 



Gly Val Lys Val Ala Val Leu Asp 
25 30 



Leu Asn lie Arg Gly Gly Ala Ser 

45 



Gin Asp Gly Asn Gly His Gly Thr 
60 



Leu Asn Asn Ser lie Gly Val Leu 
75 80 



Tyr Ala Val Lys Val Leu Gly Ala 
90 95 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 

260 265 
INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: A188P r V193M, V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala 
l 5 10 
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Asp Gly Arg Cly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp lie 
180 185 190 

Met Ala Pro Gly Val Asn lie Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SEHSE: NO 

(vi) ORIGINAL SOURCE: 

( ' (A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSN 5843 

(vii) IMMEDIATE SOURCE: „,„,., 
(B) CLONE: S3T, A188P, V193M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ar 
10 



Ala Gin Thr val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 

20 *° 

Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 



His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

J 85 90 

Asp Gly Arg Gly Ala II. Ser Ser II. Ala Gin Gly Leu Glu Trp Ala 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 

115 120 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser * 



145 



Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
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65 

Asn Asn Asn Arg Ala Ser Phe Scr Gin Tyr Gly Pro Gly Leu Asp lie 
180 185 190 

Met Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE : 
(B) CLONE: S157T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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66 

Th r ciy XX. - T H.S «. ; . ~» - xx. - «r u. - 

35 



His Val Ala . 
65 



tv^ cm asp Gly Asn Gly His Gly Thr 
Phe Val Pro Gly Glu Pro ser Thr Gin Asp y ^ 

50 

Gly mr II. — -» S " " e « 

70 

Gly v.l XI. Ser AX. «» M u Tyr Al. v.l W V.I I*u Gly Ala 

„ TlA l1a cin Gly Leu Glu Trp Ala 
Asp Gly Arc, Gly Ala He S.r Ser lie Al. My lM 

100 

Gly A.n Asn Gly mt His V.I Al. A.n ^ Ser - $ Ser Pro Ser 

115 

XI. Tfcr I- 01. «n Al. val A.n ser Al. T h r s.r Ar, «y 

135 

val val Ala Al. S.r Gly A.„ Ser Gly Al. Thr Ser lie Ser 

150 

»i. Mot Ala Val Gly Ala Thr Asp Gin 
Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala val y i?g 

_ c __ Gln Tyr Gly Ala Gly Leu Asp He 
Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr y w 

«»i n« ser Thr Tyr Pro Gly Ser Thr Tyr 
val Ala Pro Gly Val Asn Val Gin Ser Thr yr ^ 

195 ' 

iu Thr Pro His Val Ala Gly Ala 
Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro ^ 

210 210 

, r v^ro ser Trp Ser Asn Val Gin He 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp 240 

225 230 



Pr ° ??n ~" «5 

130 



Val Leu 
145 X 
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Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: A188P, V193M 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
15 10 IS 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 
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Asp ciy Ar, «y »• s « "J " a C1Y ^ "o 



100 



Gly Asn Asn Gly Het His Va! Ala Asn L*u Ser I*u My Ser Pro Ser 

115 ■ L, * u 



Pro ser Ala Thr Leu Olu Gin Ala Val Asn Ser Ala Thr Ser Ar, Gly 
130 135 

val Leu val val Ala Ala Ser Gly Asn Ser Oly Ala Ser Ser He Ser 

14 5 

Ty r Pro Ala Arg Tyr Ala Asn Ala Met Ala Val cly Ala Thr Asp Gin 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp He 

180 183 

Met Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 

5 no 205 
195 200 

Ala ser Leu Asn Gly Thr Ser W Ala Thr Pro His Val Ala Gly Ala 

210 215 



240 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 



(2) INFORMATION FOR SEQ ID NO: 10: 



fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A188P 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
! 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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70 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: S3T, V4I, A188P, V193M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ala Gin Thr He Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

• His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 

20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp lie 
180 185 190 

Met Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 
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Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Sef"Tttr Asn Leu 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

>) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: V193M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arc Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 
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Asp Gly Arg Gly Ala lie Ser Ser He Ala Gin Gly Leu "Glu~Trp"Xia 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Met Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

( (A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S104T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 ^ ^ 

Phe val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 

50 55 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 

Asp Gly Arg Gly Ala He Ser Thr lie Ala Gin Gly Leu Glu Trp Ala 

100 105 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly ser Pro Ser 
115 120 125 

Pro ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 



145 



150 155 1«° 



Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 



165 



9221760A1 I > 



SUBSTITUTE SHEET 



WO 92/21760 



75 



PCT/US92/04306 



Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T69V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Val He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Glv Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
J 85 90 95 

ASP Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tvr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyx Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 
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AT, ASH Hi. I*u Lys As „ Thr M . Thr ^ Wy ^ ^ ^ 



250 

255 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(IV) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: V4I, A188P, V193M 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ala Gin ser He Pro Trp Gly n . Ser ^ Val Glfi ^ ^ ^ ^ 

15 

His As„ Arg „ y Leu Gly Ser Jy w ^ ^ ^ ^ ^ 

3 30 

»r Gly jj. ser ^ 8is Pro ^ ^ ^ ^ 

45 

»>e va! „ y Glu Pro f . r ^ cl „ Asp My ^ ^ ^ ^ ^ 

6 0 

His val Ala Gly Thr He Ai a »i a t 

65 70 AU ^ ASn A * n Ser Gly Val Leu 

75 _ 

80 

«y Ma Pro f . r Ala 01u ^ ^ vsi ^ ^ ^ 

95 



.9221760A1J_> 



SUBSTITUTE SHEET 



WO 92/21760 



78 



PCT/US92/04306 



Asp Gly Arg Gly Ala He Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Pro Gly Leu Asp He 
180 185 190 

Met Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN : Bacillus lentus DSM 5643 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A224V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asd 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 J 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trr> Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 12 5 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Ara Glv 
130 135 140 3 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 15° 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 17 5 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr cly Ala Gly Leu Asp He 
180 185 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Val 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
_ _ ^ 230 235 240 



225 



Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
= - - - 250 255 



245 

Tvr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 



_9221760A1_I_> 



SUBSTITUTE SHEET 



WO 92/21760 81 PCT/US92/04306 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn He Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

1) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: V4I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ala Gin Ser He Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S3T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ala Gin Thr Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Glv Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

* 85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 3.05 110 

'lv Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 

* H5 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tvr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
* 165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 

225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S139Y 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 

* 40 * 3 



35 



Phe val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

* 85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 1 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 m 120 125 

Pro ser Ala Thr Leu Glu Gin Ala Val Asn Tyr Ala Thr Ser Arg Gly 
130 I 35 140 

val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 1 9 ° 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

* 245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Aro 
260 265 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: N242A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 

5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 

40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 

=>u 55 g 0 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser He Gly Val Leu 

0 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 J 

Asp Gly Arg Gly Ala He Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 

100 105 no 
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Oly Asn A,n Gly Met Hi. v.1 Al. Asn Leu ser Leu Gly Ser Pro Ser 

115 

Pro ser JO. Thr Leu Glu Gin Al. v.1 Asn Ser Ala Thr Ser Arc, Gly 

130 135 

val Leu val val Ala Ala Ser Gly Asn ser Gly Ala Ser Ser lie Ser 

14 5 

Tyr Pro Ala Arg Tyr Al. Asn Ala Met Al. V.1 Gly Al. Thr Asp Gin 
Asn Asn Asn Arg Ala ser Phe Ser Gin Tyr Gly Al. Gly Leu Asp II. 

val Al. Pro Gly val Asn Val Gin ser Thr Tyr Pro Gly Ser Thr Tyr 

195 200 

Ala ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 



225 



Arg Ala His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSH 584 3 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S236T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Ara Glv 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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A S n A S n Asn Arg Ala Ser Phe ser Gin Tyr Gly Ala Gly Leu A S p II. 



180 



val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly ser Thr Tyr 

195 200 20:> 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Thr Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
* 245 250 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S36A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ala Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asd Glv Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
F 1 100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE : 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: H243A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
! 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
€5 70 75 80 

Glv Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
* 85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn Ala Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A101T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Glv Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
3 85 90 95 

Asp Gly Arg Gly Thr He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 



Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 175 
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Asn Asn Asn Arg 
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Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 
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Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 

185 190 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 



Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S236A 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 : 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ala Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 584 3 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: E87R 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Ala Gin ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 



15 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Arg Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 584 3 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N114S 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asd 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 J so 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Ser Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Glv 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 



Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 



Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lent us DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A47W 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Trp Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Ara Glv 
130 135 140 y 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 1 60 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Glv Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

Lii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

Cvi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

'ii) IMMEDIATE SOURCE: 
(B) CLONE: A120S 

k 'xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asd 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 no 
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Gly Asn Asn Gly Met His Val Ser Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Aro Glv 
130 135 140 * 7 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 1 60 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asd He 
180 185 1 90 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Glv Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Aro 
260 265 * 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T56V 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Val Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 



His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 584 3 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A120V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
•»r 40 45 



35 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Glv Asn Asn Gly Met His Val Val Asn Leu Ser Leu Gly Ser Pro Ser 
3 115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tvr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
J 165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: G205V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 



Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arc Glv 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Val Ser Thr Tvr 
195 200 205 

210 215 Ala Gly Ala 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: protein 

fiii) HYPOTHETICAL: NO 

Civ) ANTI -SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S130A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 HO 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ala Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Glv 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 1S 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Wla Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
?225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S130T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Thr Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

243 250 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 36: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

Jfii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A96I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly lie 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 

100 105 110 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
H5 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

^ Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: b^.... 

S! ssgsr^s™ est™ 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S104T, S139Y, A««v 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 



lis Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 " 

Thr Gly lie ser Thr His Pro Asp Leu Asn II. Ar g Sly Gly Al. ser 

Phe val Pro Gly Glu Pro ser Thr Gin Asp Gly Asn Gly His Gly Thr 

50 55 



Hi. val Ala Gly Thr II. Al. Al. Leu Asn Asn Ser He Gly V.l Leu 

65 70 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 

Asp Gly Arg Gly Ala He Ser Thr lie Ala Gin Gly Leu Glu Trp Ala 
Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 



115 



Pro ser Ala Thr Leu Glu Gin Ala Val Asn Tyr Ala Thr Ser Arg Gly 
130 135 



145 



val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp ire 
180 185 iS 

val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 

195 200 <U3 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Val 

210 215 220 



V 



225 



230 



Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 



245 



Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S139A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
! 5 10 1S 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala :r»r 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

\ ' 85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ala Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 

260 265 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

! . (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S142T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 4° 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 
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Glv Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
1 us 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Thr Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Ara Asn His Leu Lys Asn Thr Ala Thr Ser- Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN : Bacillus lentus DSM 584 3 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S139T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Glv Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
!15 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Thr Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 



SUBSTITUTE SHEET 



WO 92/21760 120 PCT/US92/04306 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Cly Leu Asp lie 
180 18: > 

val Ala Pro Gly val Asn val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 

19 5 200 * u= > 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
| 225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: I102W 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 

10 15 



1 5 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala Trp Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

7 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A96N 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Asn 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 
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Gly Ash Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Glv 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 1 60 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N42F 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Ala Gin Ser Val Pro Trp Gly lie ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

► 

; His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Phe lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser lie Ser 
145 150 155 160 

4 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 

180 185 2.90 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tvr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arq 
260 265 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE : 
(B) CLONE: S142A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 



15 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 

20 25 — 



30 



SUBSTITUTE SHEET 



_9221760A1_I_> 



WO 92/21760 



126 



PCT/US92/04306 



Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ala Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He . 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 
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Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 

260 265 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: H118F 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
1 5 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser lie Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala lie Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 
100 105 110 
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Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160. 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 x / u 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
' 245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N237A 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Pro Trp Gly He Ser Ar 
5 10 



Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 

1 5 in 



15 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asd 
20 25 * 



30 



Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 42 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
3U 55 g 0 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser He Gly Val Le U 

75 80 

Gly val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 



95 



Asp Gly Arg Gly Ala He Ser Ser lie Ala Gin Gly Leu Glu Trp Ala 

105 110 



Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 

120 12 5 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 



140 



Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 

150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
!95 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Ala Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N255P 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Sly A la Ser 

4 0 ^ 5 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 

5 60 

His Val AL Gly Thr lie Al. Ala Leu Asn Asn Ser lie Gly val Leu 

75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

90 95 

Asp Gly Arg Gly Ala lie Ser Ser He Ala Gin n„ t 

100 7n= Gly Leu Glu T rP Ala 

io:> 110 

Gly Asn Asn Gly H.t His v.l Ala Asn Leu Ser Leu 0 i y Ser Pro s „ 

120 125 

Pro ser Ale Thr Leu Glu Gin Ale v.l Asn Ser Ala Thr Ser Ar, Gly 
V.l Leu v,l val A!a Ala Ser Gly Asn Ser Gly Ala ser ser He ser 

155 160 

Tyr Pro Ala Arg Tyr Al. Asn Ala Met Ala v.l Gly Ala Thr Asp Gin 

170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tvr G ] V »i a n T 

180 Tyr Gly Ala G1 y Leu Asp He 

185 190 

v.1 Al. pro Gly val Asn val Gin ser Thr Tyr Pro Gly ser Thr Tyr 
Al. ser Leu Asn Gly Thr ser Mat Al, Thr Pro Hi, Val Ala Gly Ala 

220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 

235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Pro Leu 



250 255 



SU 3ST?tu T E S H EE" 



760A1J_> 



WO 92/21760 PCT/US92/04306 

132 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: T141W, N237A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn lie Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 
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Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Trp Ser Ara Glv 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 1? 5 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

210 215 Ala Gly Ala 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Ala Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Ara 
260 265 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T268V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
15 10 is 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asd 
20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Glv Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 7 ° 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 J 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 HO 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
US 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Ara Glv 
130 135 140 * 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin * 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 

18° 185 190 
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Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Val Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: K229W 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Ala Gin Ser Val Pro Trp Gly lie Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 
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Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala 5er 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr lie Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 



Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

♦ 

Ala Ala Leu Val Trp Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 . 
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Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser^ftr Asn Leu 

250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Aro 
260 265 * 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN : Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T141W 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ala Gin ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 

10 15 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 

25 30 

Thr Gly 11, ser Thr Hi, Pro Asp Leu Asn He Arg Gly Gly A1 , Ser 

40 45 

Phe M Pro Gly Glu Pro Ser Thr Gl„ Asp Gly Asn Sly His Gly Thr 

60 

Bis val Al. Gly Thr lie Ala Ale Leu Asn Asn Ser He Gly Vel Le U 

75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 



90 95 
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Asp Gly Arg Gly Ala lie Ser Ser He Ala Gin Gly Leu -Glu Trp Ala 
100 105 HO 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Trp Ser Arg Gly 
130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asn Gin 

165 170 17 5 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asd He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serine Protease 

(B) STRAIN: Bacillus lentus DSM 5843 

(vii) IMMEDIATE SOURCE : 

(B) CLONE: wildtype 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 
15 10 is 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asd 
20 25 30 * 

Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 50 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 7S „ „ 



80 



Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 J 

Asp Gly Arg Gly Ala He Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 no 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 

Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
I 30 135 140 9 y 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Ser Ser He Ser 
145 I 50 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 

165 170 175 
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Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 

245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S3T, V4I, A188P, V193M, V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GCGCAAACAA TCCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 
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TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 



GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 



GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 



AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 



TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 
600 



AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 



■ISDOCID; <WO 9221760A1J_> 



WO 92/21760 142 

(B) CLONE: S3T, A188P, V193H, V199I 



PCT/US92/043O6 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

GCGCAAACAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

. TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

. GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 



AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 



GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 



GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 



CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSH 5483 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: V4I, A188P, V193M, V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

GCGCAATCAA TCCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 
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CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus OSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S139Y, A188P, V193M, V199I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
X8 0 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

G6CGTA6CGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATTATGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
54 0 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 

600 
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AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 

660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
78 0 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HY.r >THETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S130T, S139Y, A188P, V193M, V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
6 0 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
18 0 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 
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AATTTGAGTT TAGGAAGCCC TTCGCCAACA GCCACACTTG AGCAAGCTGT TAATTATGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: A188P, V193M. V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 
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GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTG CAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACATTCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S3T, A188P, V193M 
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(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GCGCAAACAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 54 8 3 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S157T 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

" GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

*TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAC ATCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 



GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 



CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 
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(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

fvi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: A188P, V193M 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACT ATACG CT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 



9221760A1 I > 



W0 92/2176i PCI7US92/04306 

151 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAACCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 63 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A188P 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 
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ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAtf . <ft«ft«AGe 

tatccggccc gttatgcgaa cgcaatggca gtcggagcta ctgaccaaaa caacaaccgc 
gccagctttt cacagtatgg cccagggctt gacattgtcg caccaggggt aaacgtgcag 

600 

AGCACATACC CAGCTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 64: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: „ 1tnM 
(B) CLONE: S3T, V4I, A188P, V193M 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 64: 
GCGCAAACAA TCCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
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GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(via) IMMEDIATE SOURCE: 
(B) CLONE: V193M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
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GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTCGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCQTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAG CT A CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTATGG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S104T 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

# 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 



TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGG AAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCA CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 67: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

*(vi) ORIGINAL SOURCE: 

\ (B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T69V 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

3CGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

7TGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

!TAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
18 0 

;ggcatggca CGCATGTGGC CGGGGTTATT gctgctttaa ACAATTCGAT TGGCGTTCTT 
240 

IGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

CAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

ATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

CTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

ATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

CCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

3CACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 
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GTTGCAGGTG CACCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAi J CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: V4I, A188P, V193M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

GCGCAATCAA TCCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCAT6GCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 
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TATCCGGCCC CTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CCCAGGGCTT GACATTATGG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A224V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 
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GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTACAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG TTGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: V199I 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 
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TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

3CCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACATTCAG 
600 

VGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

3TTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

:gcaaccatc taaagaatac GGCAACGAGC TTAGGAAGCA cgaacttgta TGGAAGCGGA 
780 

ZTTGTCAATG CAGAAGCGGC AACACGC 
807 

:2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: V4I 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 

GCGCAATCAA TCCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAG CTTTGT A CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAG CTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 72: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentUS DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S3T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

GCAAACAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

GACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

AAATATTC GTGGTGGCGC TAG CTTTGT A CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

CGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

AATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

TTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

TTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

CAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

CACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

TGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 
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CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN : Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S139Y 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 



GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 



AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATTATGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 
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TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACXaCCGC l 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY:' linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 54 83 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N242A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: . 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTG CTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 
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GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
54 0 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

CGCGCACATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S236T 

(Xi) SEQUENCE DESCRIPTION: SIHQ ID N0:75: 
GCGC ££ TCAG TGCCATGGGG AATTAGCCGI GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
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TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGCGAAT 
180 

CGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGACAAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) topology: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iV) ANTI -SENSE: NO 
(Vi) ORIGINAL SOURCE: 
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(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S3 6A 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 



TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTGCAAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 



GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 



GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AG CAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 



TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAA CGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 



GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: H243A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

3CAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

3ACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

\AATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

3CATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

2GTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

VATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

rTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ITCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

PCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

IAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

^CATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

CGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 
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CGCAACGCAC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAA CTTGTA~TGG AAG CGGX~ 
78 0 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EONESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A101T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTG CAAGCCC CGGCTGCCCA TAACCGTGGA 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC . TCATCCAGAC 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGG 240 GGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
3 O 0 

ACAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 

*k mm V 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
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GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: S236A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 
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GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT * GCAC'dTlfccT 

3 S 0 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
48 0 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
54 0 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCA £J TACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGGCAAA TGTACAAATC 
CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE. TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: E87R 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
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TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

3GGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
24 0 

3GCGTAGCGC CTAGTGCGCG TCTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

3CAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

VATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ICTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

?ATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
54 0 

iCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

•GCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

TTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
GC ^o CATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

TTGTCAATG CAGAAGCGGC AACACGC 
807 

2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE:' 

(B) STRAIN: Bacillus lentus DSM 5483 
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(vii, IMMEDIATE SOURCE: PCT/US92/04306 
(B) CLONE: N114S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

TTAAATATTC GTCGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA GCAATGGCAT GCACGTTGCT 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 

ACTTCTAGAG GCGTTCTTCT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSN 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: A47W 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

3CAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

3ACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

AAATATTC GTGGTGGCTG GAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

3CATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

CGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

AATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

rTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

rTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

IAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

3ACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

PGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

2AACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 
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CTTGTCAATG CAGAAGCGGC AACACGC 
807 
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(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE : 
(B) CLONE: A120S 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
TTCACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTC* AGATGGGAAT 
GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
CCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTAGC 
AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
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GCCAGCTTTT CACACTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT * AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
7 2 0 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: .nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T56V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCGTTCA AGATGGGAAT 
GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
3C& 360 AGCT CGATTGCCCA AGG GTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
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AATTTGAGTT TACGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A120V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
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TTAAATATTC GTGGTGGCGC TACCTTTGTA CCAGGGGAAC CATCCACT^ TgI^AT 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 

GGCGTAGCGC ctagtccgga ACTATACGCT gttaaagttt TAGGAGCCGA CGGTAGAGGT 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGTT 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAA7T CAGGTGCAAG CTCAATCAGC 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 
(Vii) IMMEDIATE SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 

AGCACATACC CAGTTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S130A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

rTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

PTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

3GGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 

|3GCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

3CAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 

iATTTGAGTT TAGGAAGCCC TTCGCCAGCA GCCACACTTG AGCAAGCTGT TAATAGCGCG 

ICTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 

[^ATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 

pCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 

.GCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 

TTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

GCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
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CTTGTCAATG CAGAAGCGGC AACACGC PCT/US92/04306 
807 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S13 0T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
AATTTGAGTT TAGGAAGCCC TTCGCCAACA GCCACACTTG AGCAAGCTGT TAATAGCGCG 
ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACOAAAA CAACAACCGC 
aaa SS an CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
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VGC J60 TACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
5TTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
:gcaaccatc TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

-TTGTCAATG CAGAAGCGGC AACACGC 
807 

2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A96I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
2GCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

TGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

PAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

3GCATGGCA CGCATGTGGC CGGGACGATT CCTGCTTTAA ACAATTCGAT TGGCGTTCTT 

3CGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAATTGA CGGTAGAGGT 

^ATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
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AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGC^^AMAGOSCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

7 2 0 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: S104T, S139Y, A224V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
6 0 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
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184 

JGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

:GC GTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

;CAATCAGCA CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 

ATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATTATGCG 

-CTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

'ATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 

CCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 

GCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

TTSCAGGTG TTGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

GC 780 CATC TAAA6AATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

TTGTCAATG CAGAAGCGGC AACACGC 
807 

) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO* 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(Vii) IMMEDIATE SOURCE: 
(BJ CLONE: S139A 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATGCAGCG 
ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
GCCAGCITTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ill) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: S142T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
JCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

:TGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
TAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
GGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
GCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
CAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
ATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
CTACAAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
ATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
CCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
GC 660 TACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
TTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
JCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

rTGTCAATG CAGAAGCGGC AACACGC 
807 
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(2) INFORMATION FOR SEQ ID NO: 93: PCT/US92/04306 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE .. 3URCE: 
(B) CLONE: S139T 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATACAGCG 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
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3TTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

TGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

rTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: I102W 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 : 

[3CGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
6 0 

I .TGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

|iTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

5GGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
24 O 

iGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

JCATGGAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 

ATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 
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ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: A96N 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCACAC 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
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JGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAAACGA CGGTAGAGCT 

JCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
3 60 

ATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 

-CTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

ATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
CCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 

DUO 

GCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

TTCCAGCTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 

' CC tJS CATC taaagaatac ggcaacgagc TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

TTGTCAATG CAGAAGCGGC AACACGC 
807 

2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N42F 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCC^TAACCG^c n A 

6 0 

TTG 120 GGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

TTATTTATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
X8 0 

GGG 240 GGCA CGCATGTGGC CGGGA CGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
GGC 300 GCGC CTAGTGCGGA ACTA TACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 



GCAATCAGCT 
360 


CGATTGCCCA 


AGGGTTGGAA 


TGGGCAGGGA 


ACAATGGCAT 


GCACGTTGCT 


AATTTGAGTT 
420 


TAGGAAGCCC 


TTCGCCAAGT 


GCCACACTTG 


AGCAAGCTGT 


TAATAGCGCC 


ACTTCTAGAG 
480 


GCGTTCTTGT 


TGTAGCGGCA 


TCTGGGAATT 


CAGGTGCAAG 


CTCAATCAGC 


TATCCGGCCC 
540 


GTTATGCGAA 


CGCAATGGCA 


GTCGGAGCTA 


CTGACCAAAA 


CAACAACCGC 


GCCAGCTTTT 
600 


CACAGTATGG 


CGCAGGGCTT 


GACATTGTCG 


CACCAGGGGT 


AAACGTGCAG 


AGCACATACC 
660 


CAGGTTCAAC 


GTATGCCAGC 


TTAAACGGTA 


CATCGATGGC 


TACTCCTCAT 


GTTGCAGGTG 
720 


CAGCAGCCCT 


TGTTAAACAA 


AAGAACCCAT 


CTTGGTCCAA 


TGTACAAATC 


CGCAACCATC 
780 


TAAAGAATAC 


GGCAACGAGC 


TTAGGAAGCA 


CGAACTTGTA 


TGGAAGCGGA 


CTTGTCAATG 


CAGAAGCGGC 


AACACGC 









807 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO PCT/US92/04306 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus OSH 5483 

(vii) IMMEDIATE SOURCE : 
(B) CLONE: S142A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
ACTGCAAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
GCC^GCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
^^So^ 00 CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 99: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) topology : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: H118F 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

TTGACAGGTT CTGGTGTAAA AGTTG CTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GTTTGTTGCT 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
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CTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTAC?lXATC' 
720 

3CAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

PTGTCAATG CAGAAGCGGC aacacgc 
807 

!) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N237A 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

X3CAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

"•GACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

'AAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

JCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
300 

-'AATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

-TTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

TTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 
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TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCGC TGTACAAATC 
720 

CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: N255P 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 



GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTCTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTCTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GG6CATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
24 0 
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TTG 120 CGTT CTGGTGTAAA AGTT GCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

180 

GGG 240 GGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

TGGTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 

480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCGC TGTACAAATC 
CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T268V 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 

A *9 %J 

GGCGTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 
GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
AGC t60 TACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
GTTCCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

CTTGTCAATG CAGAAGCGGC AGTTCGC 
807 

(2) INFORMATION FOR SEQ ID NO: 104: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSN 5483 

(Vii) IMMEDIATE SOURCE: 
(B) CLONE: K229W 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

GCGCAATCAG TGCCATGGGG AATTAG CCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
6 0 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
120 

TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
180 

GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
240 

GGC 300 GCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTCCAG 
600 

AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

GTTGCAGGTG CAGCAGCCCT TGTTTGGCAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 
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CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus DSM 5483 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: T141W 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
60 

TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
GGG ^JJ GGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
GGCCTAGCGC CTAGTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA CGGTAGAGGT 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
TGGTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 

450 



<WO 9221760A1_I_> 



WO 92/21760 201 

TATCCGGCCC GTTATCCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAaT J CAA^, c 
GCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTCCAG 
AGCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
GTTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
CGCAACCATC TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 

CTTGTCAATG CAGAAGCGGC AACACGC 
807 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) organism: Bacillus lentus 

(B) STRAIN: DSM 5483 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: wild type 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GCGCAATCAG TGCCATGGGG AATTAGCCGT GTGCAAGCCC CGGCTGCCCA TAACCGTGGA 
TTGACAGGTT CTGGTGTAAA AGTTGCTGTC CTCGATACAG GTATTTCCAC TCATCCAGAC 
TTAAATATTC GTGGTGGCGC TAGCTTTGTA CCAGGGGAAC CATCCACTCA AGATGGGAAT 
GGGCATGGCA CGCATGTGGC CGGGACGATT GCTGCTTTAA ACAATTCGAT TGGCGTTCTT 
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GGCGTAGCGC CTACTGCGGA ACTATACGCT GTTAAAGTTT TAGGAGCCGA" CGG"TA5AG~T 
300 

GCAATCAGCT CGATTGCCCA AGGGTTGGAA TGGGCAGGGA ACAATGGCAT GCACGTTGCT 
360 

AATTTGAGTT TAGGAAGCCC TTCGCCAAGT GCCACACTTG AGCAAGCTGT TAATAGCGCG 
420 

ACTTCTAGAG GCGTTCTTGT TGTAGCGGCA TCTGGGAATT CAGGTGCAAG CTCAATCAGC 
480 

TATCCGGCCC GTTATGCGAA CGCAATGGCA GTCGGAGCTA CTGACCAAAA CAACAACCGC 
540 

SCCAGCTTTT CACAGTATGG CGCAGGGCTT GACATTGTCG CACCAGGGGT AAACGTGCAG 
600 

\GCACATACC CAGGTTCAAC GTATGCCAGC TTAAACGGTA CATCGATGGC TACTCCTCAT 
660 

3TTGCAGGTG CAGCAGCCCT TGTTAAACAA AAGAACCCAT CTTGGTCCAA TGTACAAATC 
720 

:gcaaccatc TAAAGAATAC GGCAACGAGC TTAGGAAGCA CGAACTTGTA TGGAAGCGGA 
780 

2TTGTCAATG CAGAAGCGGC AACACGC 
807 
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What is claimed is: 

1. A mutant Bacillus lentus DSM 5483 protease derived by 
the replacement of at least one amino acid residue of the 
mature form of the Bacillus lentus DSM 5483 alkaline 
protease shown in SEQ ID NO: 52 wherein said one amino acid 
residue which is selected from the group consisting of 
Ser3, val4, Ser36, Asn42, Ala47, Thr56, Thr69, Glu87 
Ala96, AlalOl, 11.102, Serl04 f Asnll4, Hisll 8/ Alal2o' 
Serl30 f Serl39, Thrl41, Serl42, Serl57, Alal 88/ Vall93 ' 
Vall99, Gly20 5/ Ala224, Lys22 9/ Ser236, Asn237, Asn242 ' 
H1S243, Asn255, Thr268 is replaced with the amino acid 
residues listed in Table 2. 

2. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 199 is substituted 
by isoleucine, the valine residue at position 193 is 
substituted by methionine, the alanine residue at position 
188 is substituted by proline, the valine residue at 
position 4 is substituted by isoleucine, and the serine 
residue at position 3 is substituted by threonine. 

3. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 199 is substituted 
by isoleucine, the valine residue at position 193 is 
substituted by methionine, the alanine residue at position 
188 is substituted by proline, and the serine residue at 
position 3 is substituted by threonine. 

4. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the valine residue at position 199 is substituted 
by isoleucine, the valine residue at position 193 is 
substituted by methionine, the alanine residue at position 
188 is substituted by proline, and the valine residue at 
position 4 is substituted by isoleucine. 

5. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the valine residue at position 199 is substituted 
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by isoleucine, the valine residue at position 193 
substituted by methionine, the alanine residue at position 
188 is substituted by proline, and the serine residue at 
position 139 is substituted by tyrosine. 

6. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 199 is substituted 
by isoleucine, the valine residue at position 193 is 
substituted by methionine, the alanine residue at position 
188 is substituted by proline, the serine residue at 
position 139 is substituted by tyrosine and the serine 
residue at position 130 is substituted by threonine. 

7. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the valine residue at position 199 is substituted 
by isoleucine, the valine residue at position 193 ± B 
substituted by methionine, the alanine residue at position 
188 is substituted by proline. 

20 8. a mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 193 is substituted 
by methionine , the alanine residue at position 188 is 
substituted by proline, and the serine residue at position 
3 is substituted by threonine. 



9. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the serine residue at position 157 is substituted 
by threonine. 



10. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 193 is substituted 
by methionine, and the alanine residue at position 188 is 
substituted by proline. 

11. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the alanine residue at position 188 is substituted 
by proline. 
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12. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 193 is substituted 
by methionine, the alanine residue at position 188 is 
substituted by proline, the valine residue at position 4 is 
5 substituted by isoleucine and the serine residue at 

position 3 is substituted by threonine. 

13. A mutant Bacillus lentus DSM 5483 protease of claim 1 

10 b™-^ reSidUe ^ POSiti ° n " 3 " s *-tituted 

io t>y methionine. . 

14. A mutant Bacillus lentus DSM 5483 protease of claim i 
wherein the serine residue at position 104 is substituted 
by threonine. 
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15. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the threonine residue at position 69 is substituted 
by valine, 

16. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 193 is substituted 
by methxonme, and the alanine residue at position 188 is 
substituted by proline, and the valine residue at position 
4 is substituted by isoleucine. 

17. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the alanine residue at position 224 is substituted 
by valine. 

18. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 199 is substituted 
by isoleucine. 

19. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the valine residue at position 4 is substituted by 
isoleucine. J 
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A mutant Bacillus l e „tus DSM 5483 protease of ^ • 

5 21. A mutant Bacillus lentus DSM 5483 nr n t« 

wherein the serine residue at piiVto^T \ ^ * 
by tyrosine. 139 is substituted 

22. a mutant Bacillus lentus DSM 5483 
10 wherein » . Protease of claim l 

wnerexn the asparagine residue at r>o s ^ 
substituted by alanine. Position 242 is 

23. A mutant Bacillus lentus DSM 5483 protease of e i.- , 
wherein the serine residue at position 2 T 6 * ^ 1 

5 by threonine. Position 236 is substituted 

24. a mutant Bacillus lentus DSM 5483 bt-oi-^ 

wherein the serine residue at position 36 ***** ' 

alanine. Position 36 is substituted by 

» 

25. A mutant Bacillus lentus DSM 5483 

wherein the histidine residue It I T °* ^ 1 
substituted by alanine. * P ° Sltl ° n 243 is 



26. a mutant Bacillus lentus DSM 5483 

wherein the a i a „ {nB ^ ^ Protease of claim l 

tne alanine residue at oosi mi • 
by threonine. Position 101 is substituted 

27. a mutant Bacillus lentus DSM 5483 protease of «, « 
wherein the serine residue at position 2 7« 1 
by alanine. Position 236 is substituted 



28. a mutant Bacillus lentus DSM 5483 

wherein the glutamic acid residue ^ at positi * 
substituted by arginine Position 87 is 
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A mutant Bacillus lentus DSM 5483 protease of claim 1 

m the asparagine residue at position 114 is 

substituted by serine. 

30. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein tha alanine residue at position 47 is substituted 
by tryptophan. I ™ tM 

31. * mutant Bacillus lentus DSM 5483 protease of claim l 
by'serinl! 16 "* P ° Sition 120 iK =*»tituted 



32. A mutant Bacillus lentus DSM S483 protease of claim 1 
. Z7£ n T threOTi ' >e " SidUe " P ° SitiOT " 18 ^^«ed 

33. A mutant Bacillus lentus DSM 5483 protease of claim 1 

ZZZ™' alanin " resldue " position 120 ls —*««^ 

34. A mutant Bacillus lentus DSM 5483 protease of claim 1 
by^lL"" 6 9lyCiM rMldUe " P0Slti ° n 205 is 

«. A mutant Bacillus lentus DSM 5483 protease of daim x 
" rlne ~— " — "° - substituted 1 

36. A mutant Bacillus lentus DSM 5483 protease of claim x 

rrzonte.^ rMidu * - — - - — : 

vner.in^„r\ BaCmUS '""^ ° SM 84,3 Pr ° tM " ° f «*^» - 
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38. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the alanine residue at position «4 is substituted 
by valine, the serine residue at position iTTs 
substituted by threonine, and the serine residue at 
position 139 is substituted by tyrosine. 

39. A mutant Bacillus lentus DSM 5483 protease of claim i 

serlM residue at p ~ itiOT »* - ---ed 1 

40- A mutant Bacillus lentus DSM 5483 protease of claim l 
r^ont." riM r ~" M " — " - - 

5 41. A mutant Bacillus lentus DSM 5483 protease of claim l 

_„v,„. . . . , , Auue ac position 102 is 

substituted by tryptophan. 



wher.Ltral. a ° U1US ^ ° f 1 

by a^a^e ^ " »« *" -*•«*«*- 

wnerefn 1 " 1 ^ Ba ° U1US DSM Mt protM " « 11. 1 

wherein the asparagine residue at position 42 is 

substituted by phenylalanine. " 
45. A mutant Bacillus lentus DSM 5483 protease of claim l 

46. A mutant Bacillus lentus DSM 5483 protease of , 
wherein the histidin. residue at position A fi 
substituted by phenylalanine. P°«t«n us ls 
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47. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the asparagine residue at position 237 is 
substituted by alanine. 

48. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the asparagine residue at position 255 i s 
substituted by proline. 
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49. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the asparagine residue at position 237 is 
substituted by alanine and the threonine residue at 
position 141 is substituted by tryptophan. 

50. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the threonine residue at position 268 is 
substituted by valine. 

51. A mutant Bacillus lentus DSM 5483 protease of claim l 
wherein the lysine residue at position 229 is substituted 
by tryptophan. 



5 



52. A mutant Bacillus lentus DSM 5483 protease of claim 1 
wherein the threonine residue at position 14 1 is 
substituted by tryptophan. 

53. A mutant gene which encodes for a mutant Bacillus 
lentus DSM 5483 protease comprising in the direction of 
transcription a promoter, ribosomal binding site, 
initiation codon and the major portion of the pre region of 
the Bacillus licheniformis ATCC 53926 alkaline protease 
gene operably linked to a portion of the pre region and all 
of the pro and mature regions of the Bacillus lentus DSM 
5483 alkaline protease gene wherein one or more codons of 
said Bacillus lentus DSM 5483 alkaline protease gene are 
altered to produce a mutant gene which encodes for a 
protease derived by the replacement of at least one amino 
acid residue of the mature form of the Bacillus lentus DSM ~ 
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5483 alkaline protease wherein said one amino acid residue 
which is selected from the group consisting of Ser3, Val4, 
Ser36, Asn42, Ala47, Thr56, Thr69, GluB7, Ala96, AlalOl, 
Ilel02 f Serl04, Asnll4, Hisll8, Alal20, Serl30, Serl39, 
5 Thrl41, Serl42, Serl57, Alal88, Vall93, Vall99, 617205, 

Ala224, Lys229, Ser236, Asn237, Asn242, His243, Asn255, 
Thr268 is replaced with the amino acid residues listed in 
Table 2. 

L0 54. The mutant gene of claim 53 which encodes for said 

M 

mutant protease wherein the valine residue at position 199 
is substituted by isoleucine, the valine residue at 
position 193 is substituted by methionine, the alanine 
residue at position 188 is substituted by proline, the 
.5 valine residue at position 4 is substituted by isoleucine, 

and the serine residue at position 3 is substituted by 
threonine . 

55. The mutant gene of claim 53 which encodes for said 
0 mutant protease wherein the valine residue at position 199 

is substituted by isoleucine, the valine residue at 
position 193 is substituted by methionine, the alanine 
residue at position 188 is substituted by proline, and the 
serine residue at position 3 is substituted by threonine. 

5 

56* The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 199 
is substituted by isoleucine, the valine residue at 
position 193 is substituted by methionine, the alanine 
0 residue at position 188 is substituted by proline, and the 

valine residue at position 4 is substituted by isoleucine. 

57, The mutant gene of claim 53 which encodes for said 
5 mutant protease wherein the valine residue at position 199 

is substituted by isoleucine, the valine residue at 
position 193 is substituted by methionine, the alanine 
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residue at position 188 is substituted by proline, and the 
serine residue at position 139 is substituted by tyrosine. 

58. The mutant gene of claim 53 which encodes for said 
5 mutant protease wherein the valine residue at position 199 
is substituted by isoleucine, the valine residue at 
position 193 is substituted by methionine, the alanine 
residue at position 188 is substituted by proline, the 
serine residue at position 139 is substituted by tyrosine 
10 and the serine residue at position 130 is substituted by 
threonine . 



59. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 199 
15 is substituted by isoleucine, the valine residue at 
position 193 is substituted by methionine, the alanine 
residue at position 188 is substituted by proline. 



20 



60. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 193 
is substituted by methionine , the alanine residue at 
position 188 is substituted by proline, and the serine 
residue at position 3 is substituted by threonine. 

25 61. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the serine residue at position 157 
is substituted by threonine. 

62. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 193 
is substituted by methionine, and the alanine residue at 
position 188 is substituted by proline. 

63. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the alanine residue at position 188 
is substituted by proline. 



30 
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The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 193 
is substituted by methionine, the alanine residue at 
Position 188 is substituted by proline, the valine residue 
at position 4 is substituted by isoleucine and the serine 
residue at position 3 is substituted by threonine. 

65. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 193 
is substituted by methionine. 

66. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the serine residue at position 104 
is substituted by threonine. 

67. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the threonine residue at position 
69 as substituted by valine. 

68. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 193 
xs substituted by methionine, and the alanine residue at 
position 188 is substituted by proline, and the valine 
residue at position 4 is substituted by isoleucine. 

69. The mutant gene of cl aia 53 which encodes for said 
mutant protease wherein the alanine residue at position 224 
is substituted by valine. 

70. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 199 
as substituted by isoleucine. 

71. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the valine residue at position 4 is 
substituted by isoleucine. 
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72. The mutant gene of ei«i» « 

-t-t protease vnerein tbe ^ 'T^ ** 

substituted by threonine. " P ° sltion 3 " 



35 



73. The mutant gene of ela-i™ ^. 

«-* protease vLein^eVeVn Z^T^T .f~ — 
as substituted by tyrosine. Position 139 

74. The mutant gene of claim 53 which M „ 

mutant protease wherein the asparaJn ftt Said 

242 is substituted by alanL! ? ""^ " P ° Siti °" 

75. The mutant gene of claim 53 wh <^ 

mutant protease wherein the . enC ° < * eS * ° r said 

wnerexn the serine residue a*- n«^. 
is substituted by threonine. Position 236 

76. The mutant gene of claim 53 which » . 

mutant protease wherein th. . encodes for said 

is substituted by awne ^ * 36 

243 is substituted by alanine at P ° Sition 



25 78. 



78. The mutant gene of claim s-? u 

mutant protease wherein thl T for said 

»e wnerem the alanine residue ai- n „ e j4., 
substituted by threonine. Position 101 



wwerem the serine residue «+• 
is substituted by alanine. Position 23 6 

Position s 7 i. substitute, by " 
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81. The mutant gene of claim „u ■ ^ 

-tant please wlrei n t^^J^ 

H4 is substituted by serine. * P ° siti °* 

5 82. The mutant gene of e i»i™ r^, 

-ant protease -rein^^^ — 
18 s *>stituted *Y tryptophan. Potion 47 
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83. The mutant gene of claim 53 which *n™„ 
-tant protease wherein the alanine r^LTTZ^ 
is substituted by serine. Position 120 



84. The mutant gene of claim 53 which en eori * 
is substituted by valine St P OSiti °n 120 

) 

^rZT^iZ^ii: 53 - ich — - - — 

1= substitute* by 9 ^ reSidM at *° Siti °» 

I 25 87- The mutant gene of claim S3 vnieh encort . 

r^: prot "- — -1- B . rine rX. ,t ;„ = si ! or said 

substituted by alanine. Position 130 

^^IZ^t 53 — - — 

is substitute* ^ ZZvTn. ^ ^ rMldUe " POSl "° n 96 
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9 °- The autant gene of clai™ «, . 

herein the * *<* ««— for said 

- substituted ey valln ., ^ a ; e n ;. re " due « Portion 2U 

SI. The autant gene of C lai„ «, 

-ta» t prot . ase v « „ hloh „ cedM fw 

- substituted b y alanine. « «>°sition l 39 

92 • 1118 """tent gen. of olaia », 

-tant protease wherein the s . r "e ^ — — f « "aid 
" sub «**ut.d by threonine. at P«"ion i„ 

93 • Th » ""'ant gene of e iai„ „ 
-tent protease wherein the s e " 

" '^'"tuted by threonine. " 139 

94 • The mutant- 

utant gene of 

-«tant protease wherein tn. iso i anC ° des f « said 

" 2 18 ^"ituted b y t^an " e " 



9S- The mutant gene of • 

»utant protease wherein the alan!n ** 

- substituted by asparagt'e ' r " ldUe " « 



9 6. The 



- mutant gene of ro • 
»«tant protease wherein th e • nooa « *>r said 

« I- substituted b y Phe„^aw;r ne " 

9 7. The mutant gene of m*- 

-tant protease wherein th e «* said 
" -^stituted b y alanlna . Serine ««— at position i« 

96 • The "»"tant g.„. ef ola±m B , 

-tant protease wherein th ~— -' -X- 

" 8 " ^"ituted b y phennalaVine """" " P ° Si "°" 
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99. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the asparagine residue at position 
237 is substituted by alanine. 

100. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the asparagine residue at position 
255 is substituted by proline. 

101. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the asparagine residue at position 
237 is substituted by alanine and the threonine residue at 
position 141 is substituted by tryptophan. 

102. The mutant gene of claim 53 which encodes for said 
L5 mutant protease wherein the threonine residue at position 

268 is substituted by valine. 

103. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the lysine residue at position 229 

0 is substituted by tryptophan. 

104. The mutant gene of claim 53 which encodes for said 
mutant protease wherein the threonine residue at position 
141 is substituted by tryptophan. 

105. A hybrid plasmid capable of replication in Bacillus 
comprised of a gene which encodes for a mutant Bacillus 
lentus DSM 5483 protease comprising in the direction of 
transcription a promoter, ribosomal binding site, 
initiation codon and the major portion of the pre region of 
the Bacillus licheniformis ATCC 53926 alkaline protease 
gene operably linked to a portion of the pre region and all 
of the pro and mature regions of the Bacillus lentus DSM 
5483 alkaline protease gene followed by a 164 bp DNA 
fragment containing the transcription terminator from the 
ATCC 53926 alkaline protease gene wherein one or more 
codons of said Bacillus lentus DSM 5483 alkaline protease 
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5483 alkaline protease wherein said one a»ino acid residue 
which is selected fro* the group consisting or S er3 Vail 

Xle 10J , s.r 10 4, *sn 114 , Hisue, Aia^o, 

mrl«, serl42, S.rl57, Alales ,.,„, „ f ' Se " 39 ' 

"a 22 «, L y« 29 . S er*3e, ^ Znll ' 

= is replaced with the a»ino 



106. The hybrid plasmid of clai* 105 wherein s *i« <- 
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109. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 
residue at position 199 is substituted by isoleucine, the 
valine residue at position 193 is substituted' by 
5 methionine, the alanine residue at position 188 is 
substituted by proline, and the serine residue at position 
139 is substituted by tyrosine. 

110. The hybrid plasmid of claim 105 wherein said mutant 
10 gene encodes for said mutant protease wherein the valine 
residue at position 199 is substituted by isoleucine, the 
valine residue at position 193 is substituted by 
methionine, the alanine residue at position 188 is 
substituted by proline, the serine residue at position 139 
is substituted by tyrosine and the serine residue at 
position 130 is substituted by threonine. 

111. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 
residue at position 199 is substituted by isoleucine, the 
valine residue at position 193 is substituted by 
methionine, the alanine residue at position 188 is 
substituted by proline. 



15 



112. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 
residue at position 193 is substituted by methionine , the 
alanine residue at position 188 is substituted by proline, 
and the serine residue at position 3 is substituted by 
threonine. 



113. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 157 is substituted by threonine. 

114. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the valine 
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proline. P°«t„n 1M is substitutad by 

H5. The hybrid plasaid of clai» ,„ t . 
sen. encodes for said -u Jnt pVo t et » Utant 

a t P o Sitl0 n Ui is 1^:;;^^ iUnine 

116. The hybrid plasmid of clai. 

sen. encodes for said mutant Dr ! t ~" -*«* 

residue at position 

Uahin. residue at position a^it s^stit «- 
toe valine residue at posit' SUbstltute<a proline, 
isoleucine and the serl „e 4 " Sttbs tituted by 

substituted by threonine " P ° Siti °" 3 is 

117. The hybrid plasmid of cl ai„ ,„ „ 

Sen. encodes for .aid mutant /\ eln ffluti ">t 

residue at position ^TZ^l ^ 

tltuted b y methionine. 
118. The hybrid plasmid of claim ,„ B . 

gene encodes for said mutant / ****** Said ffiuta *t 
residue at position in* «. 2*1 **** Wherein 

serine 

S subst ituted by threonine. 
119. The hybrid plasmid of claim m* k 

gene encodes for said mutant nr ot fflUtant 
— e at position 69 .T^^t"^ 
120. Th. hybrid plaslnl 

gene encodes f or said mutant ! ♦ " ln " id "*«* 

"sidue at position 1S3 Is SU b " ™»~ 
the alanine residue at positl " * TOthio »^. — 
P»lihe, and the vaiin. r^u "J ' T^**- * 
substituted by isoleucine. Position 4 is 
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121. The hybrid plasmid of claim . 

«- encodes for said ^J ^el ^ ^ 
residue at positlon 224 is J^*^ ^™ n r alinine 

125. The hybrid plasmid of clai™ ,n* w 

gene encodes for said nulLt Said » ut *«t 

alanine. P sltlon 242 xs substituted by 

127. The hybrid plasmid of clai™ 1ne . 

gene encodes for said mutL Whereln Sa±d » uta "t 

129. Zh. hybrid plasmid of claij. ,„ B . 
gene encodes for said nut a„ t ^ Sa " nUt »* 

-due a t P o s i tion 2t3 TJZ££Z%2T»«- . 

SUBSTITUTE SHEET 



WO 92/21760 



PCT/US92/04306 



221 



"c. The hybrid placid of eUi, 105 wherein ,.•„ 
5m " f « "« "utent protease ^ere " T 

5 131 " ** hybrld P 1 "-" of ciaia 108 wherein „•„ 

Sene e„ oodes fot Eaid Butant autant 

residue at posits 236 ls substituted 

132. The hybrid plasnid of clai» ^ 
0 9 ene erodes for said -tant^e ^1"^^ 
-d -idue at position 87 is .^Z^y 

» «Para gine residue at posTttn xH"^" Wherel " 

serine. P°*«xon 114 ls substituted fcy 



134. The hybrid plasmid of claim n« R w 

gene encodes for said «uT * Said muta *t 

— e at ^"rrtzz-z^r ^ 
at 

£ ^^'^1,--- — .utaht 

* --tU^l^^^ —he 
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IJ,: T " " °* Cla±m 105 whe " in -M mutant 
gene encodes for ..id mutant prote.se wherein the serine 
rescue at position 130 is substituted by alaninT. 

140. The hybrid pl.smid of claim 105 wherein * 
gene encodes for said mutant protease £ " 

residue at position 130 is substituted by 4^ 

141. The hybrid plasmid of claim 105 whe>-„ ( „ . 

gene encodes for said mutant protease wherein "e ^ 
residue at position SS is substituted by ^oWiL ^ 

142. The hybrid plasmid of claim 105 wherein s ,<* » 
gene encodes for said mutant protease wnerTin Z 
residue at position 22 < is substituted^ vTlin. 
serine residue at position 139 is substituted Z T' 
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143. The hybrid plasmid of claim 105 wherein 

gen. encodes for said mutant prote.se ^"^ "1 2 

residue at position 13* is substituted by atanin^ ^ 

144 The hybrid plasmid of claim 105 wherein said mutant 
gen. encodes for said mutant protease wherein th7 T 
residue at position 14. is substituted by ^nt,. 

get encJT? °* ^ " 5 -tant 

gene encodes for said .utot protease wherein the ser^! 

rescue at position 13 9 is substituted by threonTne. 

sssn -due at positr rr— ed 
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147. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the alanine 
residue at position 96 is substituted by asparagine. 

148. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the 
asparagine residue at position 42 is substituted by 
phenylalanine . 

14 9. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the serine 
residue at position 142 is substituted by alanine. 

150. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the histidine 
residue at position 118 is substituted by phenylalanine. 

151. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the 
asparagine residue at position 237 is substituted by 
alanine. 

152. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the 
asparagine residue at position 255 is substituted by 
proline. 



153. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the 
asparagine residue at position 237 is substituted by 
alanine and the threonine residue at position 141 is 
substituted by tryptophan. 

154. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the threonine 
residue at position 268 is substituted by valine. 
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155. The hybrid plasmid of claim los wherein said mutant 
gene encodes for .aid mutant protease wherein the lysine 
residue at position 229 is substituted by tryptophan. 

5 156. The hybrid plasmid of claim 105 wherein said mutant 
gene encodes for said mutant protease wherein the threonine 
residue at position 141 is substituted by tryptophan. 

157. A computer based method for identifying the sites 
' which affect the stability of a target proLn cZrisLg 
the steps of: (i, generating a probe-accessible surface of 

cool^T^ Pr ° tein ^ Pr ° binB 0,6 i«l 
coordinates of said protein with an uncharged probe 

molecule having a radius of about 0.9 to about 2 0 A, 
wherein said probe-accessible surface has an external 
surface the interior of which contains one or more probe- 
accessible internal cavities, (2, identifying the amino 
acid, which make up the boundaries of the internal 
cavities, wherein said amino acids comprise a set of sites 
which when mutated increase the stability of the protein. 

Ml. The computer based method of claim 157 wherein said 
P«tease Pr0taln *" ^ »" 

ZL X ; °T ter based methoii for s it es 

wbich affect the stability of a target protein comprising 
the steps of: fl, generating a probe-accessible surface of 

"ordi!T et T*** ^ Pr0b1 " 9 toe th ^ dimensional 
coordinates of said protein with an uncharged probe 
molecule having a radius of about «., to about 2 o A 
wherein said probe-accessible surface has an eternal 
surface the interior of which contains on. or more probe- 
accessible internal cavities, (2, aligning said Lree 
dimensional coordinate of said target protein *Z a 

coorirr Pr ° teln ^ B ° Vin9 ««• «»ensional - 

coordinates of said reference protein into the coordinate 
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frame of said target protein; (3) identifying an amino acid 
in said reference protein whose side chain lies outside 
said solvent-accessible surface of said protein or inside 
said internal cavities of said target protein. 

5 

160. The computer based method of claim 159 wherein said 
target protein is Bacillus lentus DSM 5483 alkaline 
protease. 

161. The computer based method of claim 159 wherein said 
reference protein is any protein for which a three 
dimensional structure is available which is homologous to 
the target protein. 



15 162. The computer based method of claim 159 wherein said 
reference protein is thermitase. 

163. The computer based method of claim 159 wherein said 
reference protein is subtilisin Carlsberg. 

20 

164. The computer based method of claim 159 wherein said 
reference protein is subtilisin BPN». 

165. The computer based method of claim 159 wherein said 
25 reference protein is proteinase K. 

166. A. computer based method for increasing the stability 
of a protein comprising the steps of: (i) generating a 
probe-accessible surface of said target protein by probing 
the coordinates of said protein with an uncharged probe 
molecule having a radius of about 0.9 to about 2.0 A 
wherein said probe-accessible surface has an external 
surface the interior of which contains one or more probe- 
accessible internal cavities; (2) identifying the amino 

35 acids which make up the boundaries of the internal 

cavities, wherein said amino acids comprise a set of sites 
which when mutated increase the stability of the protein; " 
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(3) identifying an amino acid mutation which decreases the 
volume of said internal cavities; (4, determining i, sald 
amino acid in said target protein can he changed without 
creating unaccepted steric interaction, ; (5 , replacing 
t*e amino acid in said target protein by site dLctel 
mutagenesis of the gene which expresses said target 
protein. 9 1 

167. The computer based method of claim 166 wherein said 
target protein is Bacillus lentus os M s<83 alLl"e 



protease 



L5 



168. The computer based method of claim 166 wherein said 
reference protein is any protein for which a three 
dimensional structure is available which is homologous to 
the target protein. 

169. The computer based method of claim lee wherein said 
reference protein is thermitase. 

170. The computer based method of claim lee „ h « eln 
reference protein is subtilisin Carlsberg. 

171. The computer based method of claim 166 wherein said 
reference protein is subtilisin BPM-. 

172. The computer based method of claim lee wherein said 
reference protein is proteinase K. 

orYorT**" tMed Mth0d in =«-^ the st^ility 

oLL rising the steps of: ,„ generating a 

probe-accessible surface of said target protein by probing 

molecule having a radius of about 0.9 to about 2 0 A 
wherein said probe-accessible surface has an 
surface the interior of which contains one or more^roT.- 
accessible internal cavities, (2) Alignin3 8aia 

SU33TJT UTS SKElT 
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dimensional coordinates of sald target 
reference protein by Moving the thr« dT„ ° 
coordinates of said reference protein i^ the c^w 
frame of said target protein; p, identifying °° r,ilnate 
- said reference protein vhos side chat ST" 0 ,."?" 
eaid solvent-accessible surface of said ™ ° 
said internal cavities o, said tara!f " ^ 
(4, identifying the amino acid in said targ e 7" efT" 
occupies the eguivalent position as «i d ~ " ! 
reference protein; (5, determining i, said L!° * 
said target protein ,.„„ >. ' J lno aoid i" 

unacceptable stertc effects (6) T 
in said target protein"'^ Li ! — » 

directed mutagenesis of the gene llich * " te ~ 

target protein. 1Ch «P«sse S said 

174. The computer based method of claim 17 3 „k • 
target protein is Bacillus ientul M »TVLT 
protease. 483 alkaline 



175. The computer based method of claim 173 web- 
reference protein i« wherein said 

protein is any protein for which » - 
dimensional structure is available which is hi t 
the target protein. homologous to 

176. The computer based method of claim 173 „ her • 
reference protein is thermitase. Said 

177. The computer based method of claim 173 „h . 
reference protein is subtili sin <J^. " 

178. The computer based method of claim i„ „ . 
reference protein is subtilisin BP N ' ^ 

179. The computer based method of claim „k • 
reference protein is proteinase K Said 



SUBSTITUTE SHEET 



.9221760A1_L> 



WO92/21760 



PCT/US92/04306 



1/18 
FIGURE 1 



1 


GLY 


C 


27.985 


27.065 


1 


GLY 


o 


26.834 


26.692 


1 


CLY 


H 


27.785 


25.660 


1 


OLY 


CA 


28.517 


26.825 


2 


GUI 


H 


28.745 


27.565 


2 


GLH 


CA 


28.205 


27.868 


2 


GLN 


CB 


29.179 


27.265 


2 


GLH 


CO 


28.905 


27.589 




GLN 


CD 


29.834 


26.805 




GLH 


OKI 


29.476 


25.685 




GLH 


HB2 


31.008 


27.317 


2 


GUI 


c 


28.045 


29.384 


2 


GLH 


o 


28.927 


30.159 






If 


26.940 


29.781 


■ 3 


9fiK 


ex 


26.568 


31.160 




ero 

oEK 


wo 


25.036 

A *r . WWW 


31.390 




5EK 




94 576 


30.913 




SSR 


r* 
\~ 


56 815 


31.424 


■» 


ofin 


r\ 
w 


26.464 


30.580 




VAX* 


fl 


07 371 


32.570 




VAX* 


CA 


37 531 


32 913 


4 


VAX* 


CB 


9fl AAA 
20 . GOV 


33. 625 


4 


VAL* 


Uii 


94 HOfl 
«7 . wo 


33.965 


4 


VAL 




JV * vwo 


39 739 


4 


VA1* 


c 


OA 347 




4 


VAL* 


O 


2o • J44 


J^ • J»W 




PRO 


N 


29 . JO* 


33 171 
J J . ^ * * 


5 


PRO 


CD 


2? . X4U 


39 114 


5 


PRO 


CA 


1A 111 
24 • JX J 


31 343 

J4 • J7J 


5 


PRO 


CB 


2 J - 4U4 


33 52 A 


5 


PRO 


CG 


2 J .629 


39 11A 
J 2 . XXV 


5 


PRO 


C 


24 • 82 J 


JO . O / / 


5 


PRO 


o 


25 # 816 


JSi DUX 


6 


TRP 


H 


24 • 12b 


JO . OU4 


6 


TRP 


CA 


24 . 97 / 


3fl O70 


6 


TRP 


CB 


•5 ^ COB 

23 • 589 


J7 . 2 J X 


6 


TRP 


CG 


22 . 3X3 


J 7 . JOU 


6 


TRP 


CD 2 


22 • 2 Jo 


4U« WOW 


6 


TRP 






34 S72 


6 


TRP 


CEO 


2 J . W7X 


lO fi71 


6 


TRP 


CSX 


21. X2U 


3R 755 
JO. / 9 •> 


6 TRP 


Iff « 


5fl 77j1 


34 OB9 


6 


TRP 




20 lfi£ 
* w • ^os 


40* 456 


6 


TRP 


CU 




41.455 


6 


TRP 




91 339 


41. 249 


6 


TRP 


c 


24.859 


38.028 


6 


TRP 


O 


25.812 


38.610 


7 


CLY 


M 


24.056 


37.299 


7 


CLY 


CA 


24.171 


37-250 


7 


CLY 


C 


25.488 


36.591 


7 


GLY 


o 


26.135 


36.993 


8 


ILK 


H 


25.911 


35.557 


8 


XLS 


CA 


27.125 


34.611 


8 


XLS 


CB 


27.250 


33.554 


8 


XLS 


CG2 


28.525 


32.760 


6 


XLS 


CGI 


26.016 


32.625 


8 


XLS 


CD 


25.683 


32.107 


8 


XLS 


C 


28.303 


35.772 



9.455 



8 ILX 0 29.238 35.790 21.181 

9 SSR N 28.225 36.591 19.264 
9 SXR CA 29.270 37.572 19.075 
9 SKA CB 29.158 38.161 17.652 
9 SXR 00 29.411 37.107 16.718 
9 SSR C 29.191 38.684 20.145 
9 SXR 0 30.236 39.113 20.660 

10 AM H 27.977 39.085 20.540 

10 ARG CA 27.775 40.132 21.537 

10 ARC CB 26.288 40.423 21.686 

10 ARG CG 25.946 41.656 22.562 

10AM CD 26.666 42.953 22.101 

10 ARG FX 26.378 43.300 20.705 

10 ARG Ct 25.394 44.138 20.338 

10 ARG RH1 25.226 44.365 19.048 

10 ARG NE2 24.604 44.767 21.215 

10 ARC C 28.351 39.782 22.893 

10 ARG O 28.942 40.673 23.476 

11 VAL H 26.222 38.532 23.377 
11 VAL CA 28.662 36.186 24.642 
11 VAL CB 28.127 37.003 25.339 
11 VAL CGI 26.664 37.416 25.538 
11 VAL CG2 28.227 35.723 24.530 
11 VAL C 30.343 37.832 24.471 

11 VAL O 31.021 37.393 25.404 

12 GLN N 30.868 37.944 23.261 
12 GLN CA 32.286 37.745 22.957 
12 GLN CB 33.129 36.763 23.772 
12 GLN CG 32.773 40.196 23.319 
12 GLN CD 33.643 41.252 23.997 
12 GLN 0Z1 34.842 41.403 23.753 
12 GLN NX2 33.145 42.035 24.926 
12 GLN C 32.806 36.330 23.186 

12 GLN O 33.978 36.104 23.557 

13 ALA N 31.938 35.350 22.940 
13 ALA CA 32.333 33.978 23.095 
13 ALA CB 31.189 33.004 22.890 
13 ALA C 33.416 33.589 22.084 

13 ALA O 34.293 32.789 22.477 

14 PRO N 33.507 34.053 20.808 
14 PRO CD 32.522 34.799 20.020 
14 PRO CA 34.622 33.646 19.943 
14 PRO CB 34.311 34.283 18.601 
14 FRO CG 32.806 34.270 18.606 
14 PRO C 35.977 34.034 20.525 

14 PRO O 36.900 33.216 20.393 

15 ALA N 36.096 35.170 21.257 
15 ALA CA 37.383 35.545 21.881 
IS ALA CB 37.253 36.887 22.612 
15 ALA C 37.837 34.470 22.892 

15 ALA O 39.024 34.129 22.980 

16 ALA R 36.899 33.826 23.591 
16 ALA CA 37.248 32.758 24.508 
16 ALA CB 36.057 32.436 25.368 
16 ALA C 37.632 31.505 23.705 

16 ALA O 36.587 30.787 24.026 

17 HIS N 36.927 31.180 22.610 
17 HIS CA 37.206 29.941 21.872 
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15.899 
15.678 
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14.173 
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21.405 
20.382 
21.602 
20.709 
21.562 
22.099 
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155 CI.Y N 
155 CLY CA 
155 Git C 

155 CLY O 

156 ALA N 
156 ALA CA 

' 156 ALA CB 
156 ALA C 

156 ALA O 
IS? SER N 

157 SER CA 
157 SER CB 
157 SER OG 



7.390 39.3»8 31.739 
I'lll 40 -2O6 31.915 
5.577 39.973 33.284 
!'?f? 40 -"8 34.319 
6.534 41.682 31.798 
f*!- 9 42.468 31.793 
7.805 42.092 31.773 
8.154 43.499 31.759 
••028 44.150 33.143 
45 ' 34 « 33.278 
7 '640 «.439 34.195 
7.476 44.065 35.498 
••649 43.170 36.405 

• 'lit f?/ 359 36 -"7 

J. 864 43.7S4 35.891 

8. 746 45.315 37.132 

9.857 45.747 37.932 

!*f!i 47 -150 38.402 

8.442 47.158 39.213 



157 SER C 10.08S 44.'g28 39 123 

III Hi 2 l S-Sl 45 - 2 " 4 ° :i " 

3*1 f5S 9.695 43.568 39.049 

iff S % 10 ' 126 <2 -«00 40 061 

If! S ? »S? 2:iS 

159 2? ??*f 01 40 ' 30 ° 38.272 

is! 13 ' 729 38.612 40.201 

i|| g* « 15.208 38.246 40 013 

iff 5 10.906 37.978 39.381 

110 «?? «f 10.806 36.974 38.510 

1 1 1 a sb m 

llS £ 10 * 947 34 -™7 39.691 

iSS ? 3 * :9 " 

161 TYR iS'fff 22* 738 40 ' 14 8 

161 w* ™ 10.867 32.645 40.876 

HI SJ S ISl I 2 - 231 «•»•■ 

161 TYR eS? 33 ' 315 43.030 

ifi ™ Si ifcSi IMS &S? 

! S i tg S:88 8:» 
iS £5 ? a ;-ffi S3S &K 

in p p i i s : a- as 2: « 

JSffi 2 iJ-525 31845 

lfit *Tm 14-352 32.580 39.814 

163 JSf ™ ff' 393 33 -S75 39.716 
ill °2 15 " S 34 43 6 38.441 
ill i£ £ ?! 538 34 *529 40.935 

i" w 2 i;-s; 34 -« 4 «.»» 

lL »J£ 14.417 34.878 41.559 

164 ARC CA 14.385 35.740 42.745 



164 ARO CB 12.934 , 

164 ARC CC 12 74? H'lll 43 * 071 

164 ARC S IJ.'jiJ JZ-; 8 J 

164 ARC HE 13 251 U'tS* 43 -906 

164 ARC CZ 14 foi 44 ' 988 

164 ARC mi iJ;2J5 39 '"° «.»01 

164 ARC NH2 15 289 Jo*??? J 6 * 838 

164 ARC C IS 033 ll'Ul 45 * 965 

164 ARC O IS S| J**" 3 43 ' 973 

165 TYR H is i|? 44 -807 
165 TYR CA if 787 JJ'fS! 44 • 046 
165 TYR CB Silt! Hinl 45 ' 176 
165 TYR CC iJ.oX JJ'fS 45 * 150 
165 TYR CD1 13 SI S'f22 45 ' 501 
165 TYR CBl S.oS I?* 807 4 6-600 
165 TYR CD2 1^ 2i* 484 4 6.88S 

;ii il i : H 

165 rr» o «•"» 

Jll iS «" J,'-i »:iS 

iniii 

as 1 a| If a:« 

169 MET CG 23 J?| ll'ZZl 38 ' 636 

If 1 If II P 

ill i m f S li 

iS i S •' 2:28 

■84 IS PH.. 

ins S I-'" 1 »":s ass 

173 ALA C 17*fil? ??"f?f 27 -"7 
w 17.667 41.631 27.812 
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FIGURE 1 



173 ALA O 16.987 42.599 28.169 
Hi J™ „ N 18 -«9 41.714 26,910 
Hi 12? W 1B ' 993 42944 2«-"8 

174 THR CB 20.504 43.281 26.349 

Hi IIP 001 21 -329 42.130 26.074 

" 4 ™* 002 20.733 43.727 27.767 

174 THR C 18.715 42.787 24.688 

174 THR O 18.622 41.662 24.148 

175 ASP N 18.674 43.934 24.029 
175 ASP CA 18.518 43.907 22.588 
175 ASP CB 17.388 44.840 22.148 
175 ASP OC 17.584 46.353 22.386 
175 ASP OD1 18.675 46.844 22.682 
175 ASP O02 16.579 47.047 22.291 
175 ASP C 19.794 44.2S8 21.834 

175 ASP O 20.844 44.480 22.440 

176 CLN N 19.724 44.498 20.516 
176 CLN CA 20.938 44.742 19.737 
176 CLN CB 20.702 44.722 18.237 
176 CLN CC 20.123 43.400 17.797 
176 CLN CD 18.592 43.272 17.887 
176 CLN 021 17.837 44.022 18.543 
176 CLN NE2 18.083 42.254 17.196 
176 CLN C 21.534 46.084 20.056 

176 CLN O 22.690 46.302 19.783 

177 ASN N 20.836 46.989 20.719 
177 ASN CA 21.382 48.288 21.098 
177 ASN CB 20.321 49.300 20.975 
177 ASN CC 19.832 49.550 19.587 
177 ASN 001 20.577 49.605 18.631 
177 ASN N02 18.526 49.678 19.484 
177 ASN C 21.895 48.299 22.521 

177 ASN O 22.380 49.322 23.026 
HI *f2 M 21.875 47.139 23.202 

178 ASN CA 22.256 47.033 24.623 
178 ASN CB 23.735 47.479 24.896 
178 ASN CG 24.734 46.515 24.314 
178 ASN O01 24.433 45.324 24.210 
178 ASN ND2 25.920 46.928 23.917 
178 ASN C 21.345 47.835 25.547 

178 ASN O 21.747 48.392 26.S76 

179 ASN N 20.081 47.806 25.174 
179 ASN CA 19.000 48.319 26.009 

HI £££ CB «•<>*« «.165 25.243 
179 ASN CG 18.566 50.593 25.088 

HI ^fS 2" 19 * 289 5i - 15 5 2 S.949 

179 ASN ND2 18.250 51.181 23.925 

Hi C 18 * 230 « 7 -"l 26.490 

HI JS! ° 18 - 2 « 46.016 25.872 

HZ °* "-734 46.241 28.230 

180 ARC CB 16.050 46.746 29.525 
HZ 05 15 * 289 «5.653 30.233 
HZ 00 46.201 31.492 
180 ARC NS 13.537 47.146 31.076 
HZ 55° Ct l2 -271 46.850 30.720 
180 ARC NH1 11.476 47.846 30.339 
HZ *£5 1,112 ll « 7 09 45.6S0 30.752 
HZ C "*«3» «S-909 27.213 
180 *?? ° K-991 46.855 26.715 
Hi H "-377 44.644 26.848 
Hi £2 °* 14 -225 44.338 26.002 
Hi ^ CB 14 366 « 2 -*83 25.663 
Hi ^ C 12 ' 9 « 44.677 26.771 
HI £J£ ° 12 - 87 3 44.495 28.009 
HI III * 45.172 26.133 
182 SER CA 10.757 45.650 26.927 



182 SIR CB 9.735 46.338 25 967 
182 m OC 9.316 45.472 24 'til 

IS S £:% : 
S «" ?:Bi HAll lisi 

183 PBS CDl 11.508 }• }}J J 6 *" 8 

183 PHZ C02 10. Ill lllUi ll'ffi 

183 PBS CZl 12.617 39 325 5i'i2? 

183 PBX CE2 U.7M ll'.lU 11 ill 

183 PHE CZ 12.774 39 6« fJ'JJf 

iS JS g *!:» nut H'B 

184 m n ii. Iff IIS™ U'lH 

?!i S J?' 998 42 • 030 30 04S 

iKS ^ it'll 6 41 - 420 30:947 

io« SER c 12.281 42.125 31 aji 
l"l£ 2 S'S? 

ss a j itm s:sj ss-ss 

185 CLN CB 11.034 40 489 34*JJ! 

ii 5 s cS s m?i 4o -*" 34:482 

185 GLN CO 8.912 39.174 35 is* 

185 CLN OS1 8.817 39 005 3 1*3 77 

HI ZH C 12 « 960 42.075. 34.773 

HI ™ ° 14 ' 066 41 '*°« 34 458 

HI ~" 12,871 43 • 046 35 676 

186 TJfR CA 14.048 43.618 36 L« 
186 TYR CB 14.488 44 924 It'lll 
186 m CG 13.385 45 992 ll'St 
186 TTR CDl 12.362 45.872 ll'tlt 
186 TTR CS1 11.347 46 805 U'tS 

iffS^S 2 13 * 38s 47: ° 49 ii:iS 

186 TTR CE2 12.386 47.988 36 396 

illZZ cz 11 - 378 47 - 8 « !!:«o 

°S X'* 1 * 48 - 846 35-328 
ill EE « ?M 3 ! 43 ' 925 37 - 8 « 

JI5 2? 2 H-fit 43 • 620 38 • 2 « 

1B7 GLT N 14.620 44.547 aa c 7 c 

187 CLY CA 14.330 44.111 ll'.Ul 

187 GLJT C 15.232 44.062 40 892 

HI™ s "- 3 « «. 5 J2 JS:!!! 

loo ALA V 14.762 43.915 42 

ill IS S il-ll 6 i 3 -" 2 

188 ALA CB 14.891 43.435 44.515 

ill 2 JS-HJ 41 • 065 42 • 549 

189 GLT N 17.263 41.594 42 9fi£ 

iS£ °£ il-lil 40 -» 3 

~I C 18 - 29 » 39.938 41.358 

}2 ?^ 0 18 * 911 38 ' 873 <1.139 

190 LEU R 18.128 40.857 40 397 
190 LEO CA 18.646 40.601 39 Ofil 

^ 00 18 • 302 «i-«54 36.607 

190 LEO CDl 17.688 40.163 36 140 
JgW ".844 42."l It'All 

So LS S fS-Xf 40 - 671 

ill 2 41 * 624 39 '589 

*} ~* ^ K 20.847 39.677 38.505 

191 ASP CA 22.285 39.597 38 558 
H" .V? CB 22.732 38 16^ ?f*« 8 
191 ASP OG 22.421 S.lfc ioIlM 
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191 ASP ODl 
191 ASP 002 
191 ASP c 

191 ASP O 

192 ILE N 
192 ILE CA 
192 ILE CB 
192 ILE CG2 
192 ILE CGI 
192 ILE CD 
192 ILE C 

192 ILE O 

193 VAL N 
193 VAL CA 
193 VAL CB 
193 VAL CGI 
193 VAL CC2 
193 VAL C 

193 VAL O 

194 ALA N 
194 ALA CA 
194 ALA CB 
194 ALA C 

194 ALA O 

195 PRO N 
195 PRO CD 
19S PRO CA 
195 PRO CB 
195 PRO CG 
195 PRO C 

195 PRO O 

196 GLY N 
196 CL3T CA 
196 GLY c 

196 GLY O 

197 VAL N 
197 VAL CA 
197 VAL CB 
197 VAL CGI 
197 VAL CG2 
197 VAL C 

197 VAL o 

198 ASN N 
198 ASN CA 
198 ASK CB 
198 ASN CG 
198 ASN ODl 
198 ASN ND2 
198 ASN C 

198 ASN o 

199 VAL N 
199 VAL CA 
199 VAL CB 
199 VAL CGI 
199 VAL OG2 
199 VAL C 

199 VAL O 

200 GLN N 
200 GLN CA 
200 GLN CB 
20O GLN CC 
20O GLN CD 
200 GLN OBI 
200 GLN NE2 
200 GLN c 

200 GLN O 

201 SER K 
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22.903 
21.685 
23.037 
24.122 
22.464 
23.192 
24.291 
23.628 
25.513 
26.686 
22.176 
21.020 
22.644 
21,847 
21.246 
20.189 
22.326 
22.653 
23.885 
21.891 
22.453 
22.770 
21.446 
20.264 
21.872 
23.294 
21.018 
21.899 
23.321 
19.802 
19.931 
16.648 
17.403 
16.401 
15.214 
16.829 
15.888 
15.690 
14.919 
15.038 
16.483 
17.672 
15.627 
15.957 
16.220 
15.814 
15.010 
16.255 
17.160 
18.147 
17.039 
18.096 
16.135 
19.303 
16.493 
17.672 
16.912 
18.706 
18.771 
19.584 
19.819 
20.240 
21.324 
19.502 
19.433 
20.567 
16.810 



36.250 41.148 
16.717 40.309 
40.095 37.355 
40.674 37.449 
1^.842 36.171 
40.070 34.908 
18.919 34.852 
17.619 34.325 
19.314 34.012 
38.323 34.226 
40.008 33.774 
39.545 33.967 
40.477 32.608 
40.379 31.392 
41.745 30.945 
42.187 31.967 
42.772 30.755 
39.820 30.203 
39.799 30.174 
39.376 29.204 
38.810 28.000 
37.303 26.253 
38.965 26.837 
39.273 27.044 
38.794 25.576 
38.583 25.188 
38.880 24.377 
38.465 23.180 
38.854 23.643 
38.002 24.479 
36.816 24.761 
38.574 24.192 
37.633 24.257 
38.217 23.175 
37.925 23.303 
38.890 22.088 
39.285 21.035 
40.877 21.010 
41-323 19.738 
41.327 22.327 
38.785 19.727 
38.697 19.432 
38.173 18.937 
37.626 17.630 
38.703 16.520 
38.095 15.160 
37.149 15.093 
38.621 14.013 
36.718 17.695 
36.910 16.978 
35.746 18.605 
34.791 18.849 
34.490 20.377 
33.623 20.702 
35.732 21.205 
33.522 18.017 
32.776 18.194 
33.324 17.005 
32.144 16.138 
32.515 14.906 
31.348 13.964 
31.677 12.544 
32.176 12.338 
31.494 11.476 
30.946 16.796 
31.114 17.277 
29.769 26.795 



201 SER 
201 SER 
201 SER 
201 SER 

201 SER 

202 THR 
202 THR 
202 THR 



202 THR OGl 
202 THR CG2 
202 THR C 

202 THR o 

203 TYR N 
203 TYR CA 
203 TYR CB 
203 TYR CG 
203 TYR CD1 
203 TYR CE1 
203 TYR CD 2 

203 TYR CE2 

203 TYR CE 

203 TYR 

203 TYR 

203 TYR 

204 PRO 
204 PRO 
204 PRO 
204 PRO 
204 PRO 
204 PRO 

204 PRO 

205 GLY 
205 GLY 
205 GLY 

205 GLY 

206 SER 
206 SER 
206 SER 
206 SER 
206 SER 

206 SER 

207 THR 
207 THR 
207 THR 
207 THR OGl 
207 THR CC2 
207 THR C 

207 THR o 

208 TYR N 
208 TYR CA 
208 TYR CB 
208 TYR CG 
208 TYR CD1 
208 TYR CE1 
208 TYR GD2 
208 TYR CE2 
208 TYR CZ 
208 TYR OH 
208 TYR c 

208 TYR O 

209 ALA N 
209 ALA CA 
209 ALA CB 
209 ALA C 

209 ALA O 

210 SER N 
210 SER CA 



CA 19.476 
CB 19.283 
OC 20.069 
C 18.875 
O 18.062 
W 19.318 
CA 16.879 
CB 19.769 
19.869 
21.204 
17.412 
16.901 
16.712 
15.266 
14.508 
13.165 
13.129 
11.918 
11.996 
10.770 
10.757 
9.560 
14.941 
15.656 
13.905 
13.057 
13.468 
12.176 
12.414 
13.249 
12.965 
13.473 
13.358 
14.643 
14.632 
15.770 
17.067 
17.523 
17.461 
17.098 
17.591 
16.566 
16.518 
15.070 
15.190 
13.924 
16.928 
17.600 
16.632 
17.071 
18.333 
19.364 
19.428 
20.274 
20.152 
20.978 
21.039 
21.935 
15.936 
15.224 
15.728 
14.653 
13.489 
15.041 
16.021 
14.378 
14.567 



OH 
C 

o 

N 

CD 
CA 
CB 
CG 
C 

o 

N 
CA 
C 

o 

N 
CA 
CB 
OC 

c 
o 

N 
CA 
CB 



28.564 
28.528 
27.563 
27.346 
27,446 
26.189 
24.880 
23.760 
24.043 
23.712 
24.706 
25.159 
23.986 
23.728 
24.620 
25.140 
25.884 
26.223 
24.708 
25.044 
25.798 
26.166 
22.322 
21.779 
21.662 
22.111 
20.319 
20.026 
20.819 
20.306 
21.337 
19.119 
16.927 
19.310 
19.630 
19.252 
19.586 
18.417 
17.216 
20.931 
21.045 
21.968 
23.294 
23.516 
24.695 
23.606 
24.275 
23.908 
25.546 
26.693 
27.321 
26.245 
25.565 
24.513 
25.869 
24.825 
24.151 
23.103 
27.689 
27.863 
28.316 
29.234 
28.364 
30.312 
30.178 
31.450 
32.642 



IBSTITUTE SHEET 



17.356 
18.691 
19.530 
16.701 
15.779 
17.171 
16.747 
17.461 
16.866 
16.688 
17.082 
18.115 
16.227 
16.398 
16.615 
16.239 
17.421 
17.992 
15.619 
16.193 
17.369 
17.949 
15.901 
15.040 
16.450 
17.596 
15.960 
16.797 
18.098 
14.463 
13.825 
13.895 
12.435 
11-724 
10.535 
12.442 
11.924 
11.036 
11.797 
11.175 
10.047 
11.642 
11.258 
10.667 
9.866 
11.700 
12.354 
13.342 
12.113 
12.914 
12.307 
12.061 
10.842 
10.646 
13.110 
12.917 
11.713 
11.601 
12.911 
11.906 
14.076 
14. 266.. 
14.707 
15.266 
16.019 
15.069 
15. 914 
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FIGURE 1 



??S III £» 14 - 614 33.893 15.065 

ai2 !™ °° 15,788 33.756 14.342 

fl ° f™ C 13.456 32.819 16.920 

2,? ° 12,255 32.689 16.610 

ill JfU M 13,895 33 '° 79 18.152 

III HE 01 12 • 990 3 3-304 19.244 

HE 12,963 32 ' 089 20.118 

ill HE J?? 12 * 368 30,848 19535 

;Ji HE £?i 12,346 29 - 857 20 -«7 

HE CT2 10,940 31.056 19.033 

HE C 13,372 34503 20- "0 

Hi HE ° 14,547 34-927 20.110 

212 ASM N 12.439 35.024 20.912 

i\l £fS f 2,734 36,191 «.741 

iH ^2 2! J 1,883 37,403 21.413 

212 ASM CO 11.961 37.853 19.972 

212 ASN O01 12.979 38.246 19.415 
HI 1102 10,841 37 ' 797 ^9-283 
2« C 12354 35 ' 787 23.156 
2*? i?5 ° 11,336 35 - 119 23.350 

213 GLY M 13.070 36.197 24.217 

f" «* « 12.648 35.928 25.599 

2H 5 13 ' 834 35 • 974 26.520 

HI ££I ° 14 * 990 35.843 26.099 

HI IS " 13 ' 583 36.141 27.832 

HI IS 14.658 36.016 28.829 

HI IS ~? Ji- 204 36,523 30 '2« 

Iw IS ~i 12,998 35 ' 812 30.594 

Hi IS 002 14 ' 0 * 4 38.055 30.271 

«i IS C 15 - 128 34.527 28.894 

HI IS S 16,253 34 - 214 29.302 

f ?S N 1*'304 33.607 28.360 

21 | f** % 14 663 ".187 28.217 

HI III CB " * 425 31,449 27 « 69 « 

ill fS °° 12 « 324 31.235 28.S64 
Hi 55 f 15,860 31 - 981 27 -237 
HI 55 ° 18 - 588 30 -"3 27 .305 
HI 51 16,039 32 • 907 2 6' 272 

HI 51 °* 17,165 32 « 901 25.324 
51 2 16,776 33 - 575 24 -oss 
ill 51 00 15,843 32 - 791 «.121 
111 51 52 14,133 32 ' 519 23.660 
III 51 018 14,311 30 - 783 23.925 
i?f SI C 18,372 33 - 638 25.885 
HI 51 ° 19,506 33 * 386 2S.460 
HI £H « ?!* 136 34,558 26.845 
Hi ?H ~ J 9,249 35 * 257 27 ' 4 «5 
iJ? 2H ^ Jf'Z 39 36,485 28.240 
ill ?H £ 19,991 34,343 28 - 432 
?iZ J££ ° 21,223 34 ' 249 28.386 
ill IS J 9,211 33 - 574 29 -" 9 

III IS J 9 * 756 32,657 30 -231 

iJf IS ^5? 18 - 587 31.860 30.888 
III IS 17 ' 719 32.837 31.429 

III IS °° 2 l 9 '° 4 0 30.887 31.979 
III IS C 20 -824 31.704 29.700 
III IS ° 21 - 912 31.648 30.275 
III 52 J! 20,683 31 * 008 28 .S86 
III S2 5? i 9 * 479 30 ' 843 27 ' 793 
HI »»2 S ? K708 30 - 099 28.089 
III »S2 68 21 ' 074 2 9-384 26.909 
III 00 19 - 9 « 30.268 26.471 

c 23,027 30 - 765 27 - 7 04 
ill b?« 2 f 4,060 30,108 27 • 745 

11% »l* K 2 2.994 32.0S1 27.345 
222 S 15 24 - 23 * 32.770 27.094 

220 BIS CB 23.997 34.219 26.600 
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220 HIS CO 23.307 34.345 25 237 

220 HIS C02 24.010 34.501 24 oil 

220 RZS HD1 21.999 34.359 IJ'SJf 

220 HIS Cll 21.849 34 Slfl ji'L, 

fi?2r I f N " 23:064 34 ' 6 " 7 ai.'ul 

Hi ill S III" 32,824 28 . 41 0 
11* .«? ° 26 - 27 6 32.626 28.383 

HI ™ Si 25.084 32.989 30.830 

221 VAX CB 24.180 33.727 31 22a 

ii, 1 ^ 001 24,746 33.6?4 ll:ii 3 7 

221 Y*f CO 2 24.119 35.194 31.366 

221 VAX. C 25.477 31.606 31 111 

221 VAL O 26.612 31 424 3?'??! 

222 ALA N 24.617 30 614 S'Hi 

2 2 2 2 2 2 j£ 2 ill" 1 29:223 31:421 

III ^ C ? 23 -871 28.283 31.032 

ill i25 S I5:ii? "-a? 

223 s » »«.»• i9 8 :i 1 8 2 J 2% 1 :f?J 
III ^ 28,928 28 - 849 

ff; ?H C 28 ' 7 " 29.661 29.070 

223 GLY O 29.806 29.064 29 098 

224 ALA N 28.557 30.955 29 357 
224 ALA CA 29.708 31.677 29 'III 

iiii^ c l It- 313 33:105 3o:SS 

Hi itf S ??*5fi 31 - 051 31.14? 

224 ALA O 31.463 30.894 31 

225 ALA N 29.387 £.580 H'ofi 
225 ALA CA 29.771 29.836 33*22? 

Ill Si c l ll-ll° 29,321 34: °2o 

III i£J S 1?'1U 28,603 32 .864 

225 ALA O 31.630 28.374 33 487 

226 ALA ti 30.248 27.816 ll 'ill 

226 ALA CA 31.033 26.664 li'lil 

SI Si c l S-lll 25 - 958 3 ° : * 8 ° 

HI ^ « !?* 446 27 ' 078 31.054 
III ^ 0 33 ' 4 21 26.381 31.370 

227 LBO H 32.587 28.209 30 328 

ill ^ 33,888 28 - 734 

227 LEO CB 33.691 29.983 28 oss 

227 LEO CG 32.901 29 762 Hill 

227 LEO CD1 32.1ii fl.'oil U'ttt 

227 LEO CD2 33.598 28.?oi It'.UI 

227 LBO C 34.782 29.060 31 088 

SIS. 2 11 lit 28,623 3i: "? 

Ill ^ " 34 ' 17 8 29.711 32.105 

228 VAL CA 34.951 30.076 33 
228 VAL CB 34.114 31.094 IJ'?!^ 
228 VAL CC1 34.822 31.451 35 50I 
228 VAL CC2 33.950 32.402 33 IS 
228 VAL C 35.340 28.SJ u'lS 

228 VAL O 36.468 28.777 li'.ilt 

229 LYS V 34.502 27.781 34 III 
229 LYS CA 34.817 26.566 34 865 

ill"! « 33 - 57 f 25J7 6 ; |4 4 :S?I 
St St S 3 3 4 3 :i% 5 § ?4 4 *4 3 ?9 4 JM2 

229 LYS CE 34.230 M.W7 ll'Ul 

229 LYS HZ 34.394 23.211 ll'lii 

SI g! S SM2 25 - 792 34 .*"o 8 
«!S S 3 3 Mi° Hi 33 34,841 

230 CLH CA 37.001 24 957 S'lll 
230 CLH CB 36.692 24,852 ll'lll 
230 CLH CC 37.819 iJ ffl JS'fJJ 
230 CLK CD 37.806 24 J« II.'JJS 
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230 OLN OKI 36.941 24.907 27.731 
230 OLN NE2 36.866 23,779 27.864 
230 CLN c 38.324 25.710 32.453 

230 CLN O 39.365 25,106 32.722 

231 LYS N 38.320 27.043 32.369 
231 LYS CA 39.482 27.877 32.678 
231 LYS CB 39.085 29.347 32.389 
231 LYS CC 40.041 30.518 32.637 
231 LYS CD 41.380 30.478 31.945 
231 LYS CE 42.078 31.872 31.997 
231 LYS NZ 42.377 32.352 33-343 
231 LYS C 39.970 27.715 34.142 

231 LYS O 41.173 27.658 34.409 

232 ASN N 39.023 27.635 35.097 
232 ASN CA 39.292 27.588 36.520 
232 ASN CB 38.801 28.648 37.227 
232 ASN CC 39.339 30.115 36.617 
232 ASN OD1 40.486 30.464 36.859 
232 ASN ND2 38.537 30.834 35.845 
232 ASN C 38.595 26.402 37.158 

232 ASN O 37.635 26.555 37.907 

233 PRO N 39.057 25.173 36.945 
233 PRO CD 40.245 24.847 36.150 
233 PRO CA 38.320 23.978 37.376 
233 PRO CB 39.053 22.819 36.729 
233 PRO CC 40.441 23.367 36.519 
233 PRO C 38.155 23.820 38.863 

233 PRO O 37.266 23.094 39.274 

234 SER N 38.962 24.489 39.675 
234 SER CA 38.725 24.374 41.124 
234 SER CB 40.005 24.643 41.961 
234 SER OG 40.378 26.007 41.847 
234 SER C 37.635 25.309 41.680 

234 SER O 37.203 25.124 42.824 

235 TRP n 37.151 26.270 40.878 

III IP °* 36 - 213 27 ' 246 *l-393 

2« IP CB 3 *-°*2 28.366 40.435 

III IP 00 37 -l« 29.323 40.391 

235 TRP CD2 37.103 30.539 39.761 

235 .TRP CE2 38.384 31.011 39.929 

235 TRP CE3 36.167 31.261 39.083 

III IP 001 3 «-«05 29.059 40.930 

235 TRP HE1 39.136 30.109 40.623 

235 TRP C22 38.726 32.237 39.404 

235 TRP C23 36.502 32.474 38.559 

235 TRP CH2 37.775 32.956 38.720 
III IP C 34 -*« 26.643 41.637 
til IP ° 34 - 427 25.726 40.941 

236 SER N 34.206 27.137 42.669 
236 SER CA 32.884 26.712 43.011 
236 SER CB 32.771 26.915 44.541 
236 SER OG 32.691 28.301 44.902 
236 SER C 31.891 27.549 42.200 

236 SER O 32.195 28.606 41.637 

237 ASN N 30.645 27.084 42.276 
237 ASN CA 29.495 27.743 41.705 
237 ASN CB 28.255 26.923 42.112 
237 ASN CC 27.966 26.679 43.605 
237 ASN OD1 28.706 27.112 44.495 
237 ASN ND2 26.851 26.017 43.928 

237 ASN c 29.388 29.219 42.117 
III ° 29 - 25 * 30.109 41.266 

III Yr^ N 29 - 592 29 - 555 «•«« 

238 VAX* CA 29.576 30.945 43.876 
238 VAL CB 29.553 30.919 45.442 
~5 VA ^ wl 23.767 32.294 46.097 
238 VAL CC2 28.199 30.344 45.605 



238 VAL 
236 VAL 

239 CLN 
239 CLN 
239 CLN 
239 CLN 
239 GLN 
239 OLN OE1 
239 OLN NE2 



C 
O 
N 
CA 
CB 
CC 
CD 



239 OLN 

239 CLN 

240 ILE 
240 ILE 
240 ILE 
240 ILE CG2 
240 ILE CC1 
240 ILE 
240 ILE 

240 ILE 

241 ARC 
241 ARG 
241 ARG 
241 ARC 
241 ARC 
241 ARC 
241 ARC 
241 ARC NH1 
241 ARC NH2 
241 ARC 

241 ARG 

242 ASN 
242 ASN 
242 ASN 
242 ASN „ 
242 ASN OD1 
242 ASN ND2 
242 ASN 

242 ASN 

243 HIS 
243 HIS 
243 HIS 
243 HIS 
243 HIS CD2 
243 HIS ND1 
243 BIS CE1 
243 BIS NE2 



C 
O 
N 
CA 
CB 



CD 
C 

o 

N 
CA 
CB 
CC 
CD 
NE 
CZ 



C 

o 

N 
CA 
CB 

CC 



C 

o 

N 
CA 
CB 
CG 



C 

o 

N 
CA 
CB 
CG 



243 BIS 

243 HIS 

244 LEU 
244 LED 
244 LED 
244 LEO 
244 LED CD1 
244 LEU CD 2 
244 LEU 

244 LEU 

245 LYS 
245 LYS 
245 LYS 
245 LYS 
245 LYS 
245 LYS 
245 LYS 
245 LYS 

245 LYS 

246 at:: 

246 ASH 



C 

o 

N 
CA 
CB 
CG 
CD 
CS 
NZ 

C 

o 

K 
CA 



30.741 
30.564 
31.903 
33.058 
34.348 
34.787 
36.001 
35.946 
37.174 
32.811 
33.124 
32.261 
31.950 
31.410 
31.025 
32.503 
32.041 
30.902 
31.087 
29.819 
28.769 
27.701 
26.634 
25.462 
24.364 
23.323 
22.325 
23.252 
29.313 
29.037 
30.153 
30.649 
31.391 
30.386 
29.177 
30.877 
31.591 
31.631 
32.330 
33.284 
34.183 
35.409 
36.367 
35.770 
36.908 
37.250 
32.559 
32.988 
31.473 
30.709 
29.576 
29.971 
28.719 
30.649 
30.147 
30.189 
29.690 
29.178 
28.452 
27.193 
26.536 
25.811 
25.221 
30.300 
30.125 
21.4C2 
32.S79 



31.770 
32.955 
31.146 
31.865 
31.007 



3C.771 



2*. 847 
2 £.629 
30.441 
32. 264 
3 j » 398 
31.291 
31.500 
30.186 
30.399 
29.161 
27.775 
32.564 
33.511 
32.484 
33.495 
33.092 
34.192 
33.692 
34.639 
34.340 
35.215 
33.149 
34.923 
35.874 
35.073 
36.413 
36.455 
36.371 
36.652 
35.681 
36.931 
38.152 
36.012 
36*451 
35.327 
35.790 
36.638 
35.447 
36.044 
36.757 
36.966 
37.984 
36.265 
36.649 
35.636 
34.234 
33.367 
34.360 
38.007 
38.853 
38.289 
39.632 
39.593 
38.687 
38.412 
39.677 
39.607 
40.665 
41.805 
•:0.279 
41.194 
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43.322 
42.971 
43.181 
42.654 
42.712 
44.165 
44.293 
44.354 
44.326 
41.203 
40.784 
40.463 
39.047 
38.368 
36.876 
38.463 
37.973 
36.696 
38.104 
39.667 
39.638 
40.655 
40.895 
41.771 
41.945 
42.749 
42.920 
43.371 
39.937 
39.200 
40.959 
41.277 
42.609 
43.746 
43.659 
44.877 
40.225 
39.938 
39.584 
38.593 
38.178 
37.413 
37.902 
36.181 
35.892 
36.945 
37.370 
36.820 
36.963 
35.801 
35.501 
34.956 
34.841 
33.602 
36.104 
35.217 
37.328 
37.654 
38.993 
38.928 
40.289 
40.573 
41.886 
37.714 
37.257 

:c.i?s 

38.352 
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249 
249 
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250 
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253 
253 
253 
253 
253 
253 
254 
254 
254 
254 
254 
254 
254 
255 
255 
255 
255 
255 
255 
255 
255 
256 
256 
2S6 



ASN 
ASN 
ASN 
ASN 
ASN 
ASN 
THK 
TKR 
THR 
TKR 
THR 
THR 
THR 
ALA 
ALA 
ALA 
ALA 
ALA 
THR 
THR 
THR 
THR 
THR 
THR 
THR 
SER 
SER 
SER 
SER 
SSR 
SER 
LEU 
LEU 
LEU 
LEU 
LEU 
LED 
LEU 
LEU 

gly 

GLY 
GLY 
GLY 
SER 
SER 
SSR 
SER 
SER 
SER 
THR 
THR 
THR 
THR 
THR 
THR 
THR 
ASN 
ASN 
ASN 
ASN 
ASN 
ASN 
ASN 
ASN 
LEU 
LEU 
LEU 



CB 
CO 
OD1 
ND2 
C 



o 

N 
CA 
CB 
OG1 
CG2 
C 

o 

N 
CA 
CB 
C 

o 

N 
CA 
CB 
OG1 
CG2 
C 

o 

N 
CA 
CB 

OG 
C 

o 

N 
CA 
CB 
CG 
CD1 
CD2 
C 

o 

N 
CA 
C 

o 

N 
CA 
CB 
OG 
C 

o 

N 
CA 
CB 
OG1 
CG2 
C 

o 

N 
CA 
CB 
CG 
OD1 
ND2 
C 

o 

N 
CA 
CB 



33.697 
33.286 
32.445 
33.814 
33.188 
33.819 
33.020 
33.574 
34.386 
33.492 
35.606 
32.516 
32.865 
31.252 
30.213 
26.829 
30.385 
30.961 
29.950 
30.001 
29.955 
31.151 
29.830 
26.630 
27.664 
29.067 
27.941 
28.405 
27.267 
27.136 
27.687 
25.824 
24.949 
24.067 
24.737 
23.663 
25.595 
24.069 
23.214 
24.239 
23.317 
22.B80 
23.651 
21.614 
20.958 
19.470 
18.813 
21.195 
20.900 
21.694 
21.773 
22.417 
23.694 
22.671 
20.311 
20.041 
19.316 
17.930 
17.061 
15.600 
15.191 
14.771 
17.441 
16.752 
17.889 
17.437 
17.435 



40.568 
40.502 
41.245 
39.538 
41.620 
42.686 
40.781 
41.115 
39.916 
38.818 
39.565 
41.547 
41.792 
41.714 
42.085 
41.914 
43.558 
44.395 
43.949 
45.323 
45.301 
44.706 
46.690 
46.105 
45.760 
47.214 
47.994 
49.102 
49.862 
48.631 
49.164 
48.523 
49.115 
48.019 
46.908 
46.020 
47.430 
50.231 
50.787 
50.606 
51.538 
50.976 
50.372 
51.241 
50.918 
51.165 
51.273 
49.492 
48.587 
49.321 
46.021 
48.071 
48.691 
46.670 
47.594 
46.419 
48.480 
48.038 
49.253 
48.927 
48.158 
49.580 
47.296 
46.279 
47.805 
47.297 
48.386 



39.196 
40.651 
41.165 
41.37S 
37.046 
36.960 
36.033 
34.734 
34.179 
34.055 
35.059 
33.737 
32.575 
34.123 
33.162 
33.800 
32.731 
33.440 
31.551 
31.096 
29.552 
29.080 
28.965 
31.676 
31.425 
32.412 
32.947 
33.875 
34.279 
31.822 
30.857 
31.929 
30.934 
30.342 
29.627 
29.043 
28.481 
31.462 
30.769 
32.703 
33.279 
34.613 
35.376 
34.872 
36.106 
35.891 
37.150 
36.567 
35.786 
37.796 
38.431 
39.869 
39.803 
40.414 
38.557 
38.445 
38.694 
38.783 
39.031 
39.271 
40.157 
38.459 
37.526 
37.550 
36.3C3 
35.106 
34.041 



256 LEU CO 
256 LEU CD1 
256 LEU CD 2 
256 LEU C 

256 LEU 0 

257 TYR N 
257 TYR CA 
257 TYR CB 
257 TYR CG 
257 TYR CD1 
257 TYR CE1 
257 TYR C02 
257 TYR CE2 
257 TYR CX 
257 TYR 
257 TYR 

257 TYR 

258 GLY 
258 GLY 
258 GLY 

258 GLY 

259 SER 
259 SER 
259 SER 
259 SER 
259 SER 

259 SER 

260 GLY 
260 GLY 
260 GLY 

260 GLY 

261 LEU 
261 LEU 
261 LEU 
261 LEU 
261 LEU COl 
261 LEU CD 2 
261 LEU C 

261 LEU 

262 VAL 
262 VAL 
262 VAL „ 
262 VAL CGI 
262 VAL CC2 
262 VAL C 

262 VAL O 

263 ASN N 
263 ASN CA 
263 ASN CB 
263 ASN CG 
263 ASN OD1 
263 ASN ND2 
263 ASN 

263 ASN 

264 ALA 
264 ALA 
264 ALA 
264 ALA 

264 ALA 

265 GLU 
265 GLU 
265 GLU 
265 GLU 
265 GLU 
26S GLU OE1 
265 GLO OE2 
265 GLU C 



OH 
C 
O 
N 

CA 
C 

o 

N 
CA 
CB 

OG 
C 

o 

N 
CA 
C 

o 

N 
CA 
CB 
CG 



O 
N 
CA 
CB 



C 

o 

N 
CA 
CB 

C 

o 

N 

CA 

CB 
CG 
CO 



16.565 
16.919 
15.095 
16.264 
17.798 
19.566 
20.590 
21.606 
20.957 
20.349 
19.733 
20.951 
20.330 
19.731 
19.142 
21.424 
22.226 
21.305 
22.222 
23.630 
23.896 
24.511 
25.897 
26.747 
26.779 
26.153 
27.225 
25.225 
25.413 
25.476 
24.999 
26.036 
26.105 
26.274 
26.349 
25.064 
26.675 
27.234 
28.410 
26.851 
27.872 
27.227 
26.633 
28.305 
28.935 
26.661 
30.181 
31.271 
31.866 
33.072 
33.666 
33.498 
32.250 
33.119 
32.136 
32.947 
32.528 
34.404 
35.259 
34.752 
36.169 
36.346 
37.790 
38.470 
39.623 
37.835 
36.745 



49.634 
50.482 
49.321 
46.162 
45.300 
46.246 
45.390 
46.225 
47.106 
46.459 
47.179 
48.503 
49.219 
48.536 
49.131 
44.557 
43.739 
44.756 
44.130 
44.552 
45.710 
43.586 
43.856 
42.633 
42.518 
44.278 
44.656 
44.013 
44.431 
43.210 
42.106 
43.461 
42.461 
43.195 
42.381 
41.598 
43.372 
41.470 
41.842 
40.192 
39.161 
37.754 
37.448 
36.734 
39.300 
39.699 
39.070 
39.216 
40.599 
40.880 
40.009 
42.124 
38.066 
37.994 
37.126 
35.931 
34.857 
36.250 
35.517 
37.304 
37.625 
38.768 
39.302 
40.138 
39.854 
41.060 
38.057 



34.134 
32.887 
34.182 
34.546 
33.803 
34.688 
34.268 
33.447 
32.389 
31.337 
30.384 
32.449 
31.446 
30.426 
29.335 
35.226 
34.776 
36.542 
37.496 
37.201 
36.877 
37.273 
36.955 
37.239 
38.660 
3S.527 
35.285 
34.600 
33.222 
32.331 
32.672 
31.151 
30.087 
28.721 
27.424 
27.191 
26.282 
30.309 
30.426 
30.263 
30.432 
30.407 
29.036 
30.824 
29.331 
28.193 
29.700 
28.755 
28.993 
28.136 
27.502 
28.143 
28.945 
29.826 
28.030 
26.088 
27.080 
27.801 
28.331 
27.054 
26.884 
25.842 
25.597 
26.723 
27.100 
27.255 
28.227 
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IK IVt ° 37,766 37 '" 4 «••»» 

IK ^ « 36,098 39 '°2° 28.897 

IK ^ % 36,698 39 ' 53 * 30 -309 

266 ALA CB 35.9S9 40.800 30.534 

lit t^t C 36,677 38 ' 48S 31-228 
IK *^ 2 37,562 38 ' 418 ".099 

IK « 35,677 37,593 3i -»i 

IK ^ 35,566 36,560 32.179 

IK ^ CB 34,165 35 ' 963 3 2-078 

IK 0"? 2 36,616 35 ' 454 »'0I7 

IK ° 36 - fil l 3«.737 33.081 

IK 22 37,257 35,279 30 - 9 27 

IK J22 22 36,227 34,187 30 ' 7 51 

IK £22 ~? 37,888 33,276 29.515 

268 TBR OG1 37.799 34.092 28.362 

268 THR CC2 36.575 32.530 29.710 

IK S C 39 617 34 - 7 « 30.576 

IK 12? 2 J£' 534 33 ' 996 30 -2« 

IK 222 39,728 36 - 045 30 -801 

IK 222 22 41,008 36,690 30 '"° 

IK 222 22 40,656 38,156 3o - 83 » 
IK 222 22 4 H 24 39 - 000 30 - 47 2 

IK 222 S 4 i* 544 40,401 29.949 

IK iE? 22 ff'fi 1 40,930 29.432 

IK 222 »2? 43,324 42,136 29.787 

IK 222 22! 44,518 42 -«3 29.265 

269 ARC NH2 42.681 42.951 30.667 

ill £*° C 4 l- 8 <4 36.161 32.014 
269 ARC OT1 41.328 35.597 32.990 
A«C OT2 43.070 36.206 31.952 

III 22 27,629 24,423 S-Sj 

!!! 01 18 - 48 2 3S.001 42.551 

IJ! 2!° ° H2 35,625 "'277 36.682 

!!? 2!° ° H2 19 ' 773 36.339 42.049 

Hi 2!2 2 H H 28,438 25.352 47.303 

III 2!2 OH2 25 ' 023 30 -639 43.381 

Hi 2!2 ° H2 23,352 28.163 42.310 

ill 2!2 ° H2 21,594 3S - 893 38-729 

III 2!2 ° H2 22,058 31 -3" 19-688 

HI 2!2 ° H2 18 * 752 4S.063 40.645 

IS? 212 22! 18 ' 039 30 -216 23.iii 

III IIS 22! J 4 * 078 9 ' 380 32 -3S6 

III 2!2 OH2 15 ' 449 39.938 28.355 

IK So 22! ^- 927 25,605 30 - 47 « 

Hi 2!2 OH2 32.858 32.346 37.185 

HI 22° OH2 31.544 33.624 27.713 

IK 2!2 OH2 31.580 8.103 M.Si 

ISa 212 22! 4 2' 076 35854 34.697 

HI 2!° OH2 8 ' 591 11.660 25.062 

Ul 212 22! ? 4,301 29 - 140 35.265 

291 212 22! 2 0,440 24,492 4 3-369 

292 212 22! !5' 793 42 ' 916 26.272 
tit OH2 30.881 38.720 32.534 

III 212 222 29,323 24.894 I; Mi 

Hi 22° 082 3°-053 41.242 26.124 
295 B20 OH2 26.029 30.946 34 554 

III 212 ° H2 23,950 42 .«30 40 Mi 
ill 222 OH2 2 2.857 33.906 20.288 

III 212 221 2 !* 750 12,657 2o.«S 

ill 2!2 OB2 36.182 42.867 32.920 

12? 2!2 ° H2 20,509 35 • 549 " 195 

!2! 22° OH2 21 ' 065 4 3.«8 15 225 

121 2!2 ° H2 12 ' 353 41 -495 42 254 

!SJ 212 22! 1 !- 733 34 - 741 i4 -° 55 

zZi 2 2 22? 7,156 35456 31.880 

iol ZlZ Z2i 7,914 47,871 34 -*'o 

306 H20 OH2 5.154 42.915 39.674 
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307 H20 OK2 26.065 37.253 43 74i 

308 H20 OH2 11.945 45 684 JM™ 

309 H20 OH2 19.111 ll'Ml Jo'll? 

310 H20 OH2 38.430 41.954 Js 

lii SIS SSI il-Sl 29 873 16:866 

in si *g urn mil S'ln 

Hi B 22? a l:J gg g:ffi 
8? SIS S2I g;!S S : S if'i 

a S3 gaff p 

323 H20 OH2 22.023 36.663 14 105 

324 H20 OH2 26.324 19.922 21 Si 

325 H20 OH2 30.661 17 697 35 ill 

326 H20 OH2 8.433 17 '21 ll'lll 

327 H20 OB2 32.wl «: 7 8 f 3 it'Ul 

328 H20 OH2 32.606 20.038 14 '21! 

329 H20 OB2 27.918 17 370 ll'lU 

330 H20 0B2 17.445 14 oJJ |i'?!2 

331 H20 OH2 16.s" }J ?|J fj*" 9 

332 H20 OH2 15.380 i4.546 l!'!?? 

333 H20 OB2 12.129 I6.0JJ ll'lll 

334 H20 OB2 13.873 16 685 ll'lll 

335 B20 OH2 6.048 18 751 ll'Sl 

3 3 3 3 7 6 2ISSI {-Si 16 - 951 35:536 

3398 ii Saas* 
ass »iS&i f 

346 H2D OH2 12.5iS J?;? S | JS'S! 

347 H20 OH2 9.164 48.300 42 769 

348 H20 OH2 11.958 43 33a if ac? 

349 B20 OH2 n SS il.'JIf JJJI} 

350 H2OOH2 4.931 44. SM il.'S 

1IIS2 2K f* 403 36,291 34 8 « 

J52 B20 OB2 5.564 39 761 » , 

353 B20 OB2 8 066 29 304 ll"f<! 

401 B2G OB2 23.SI! "JSJ J 2 * 467 

402 H20 OB2 22.840 42.988 S'iS 

403 B20 OB2 24.648 47.6S3 S'Jif 

404 B2D OH2 22.1 5 i iJ J" UM\ 

J»°. 7 S ^.-S! JM» 

J?? S S:iS : 0 : 

411B2GOB2 15. m ii.'lfo K'KI 

412 B20 OB2 33.843 20.940 9 Iff 

413 B2C OS2 16.995 IS 196 2 li!7 
415B2O0B2 38.899 33.531 34*112- 

416 B20 OB2 17.892 19 864 II III ' 

417 B20 OH2 34.568 HMt ti'HZ 

419 B20 OB2 35. 62! IS ii! «*^° 

420 B20 OH2 O.20€ 12 42fl ll'lll 

421 H20 OS1 38.653 ll'Ul It'^l 

422 B20 0H2 27.5 2 I U'.iH jJ^'J} 
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423 H20 OH2 

425 H20 OH2 

426 K20 OH2 

427 K20 OH2 
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432 H20 OH2 

433 K20 OH2 

434 K20 OH2 

435 H20 OK2 

436 H20 OH2 

437 H20 OH2 

438 H20 OK2 

439 H20 OH2 

440 H20 OH2 

441 H20 OK2 

442 H20 OH2 

443 H20 OH2 

444 H20 OH2 

445 H20 OH2 

446 H20 OH2 

448 H20 OH2 

449 K20 OH2 

451 K20 OH2 

452 K20 OH2 

453 B20 OH2 

454 H20 OH2 

455 H20 OH2 

456 H20 OH2 

457 B20 OH2 

458 H20 OH2 

459 K20 OH2 

460 H20 OH2 

461 H20 OH2 

462 H20 OH2 

463 H20 OH2 

464 H20 OH2 

465 H20 OK2 

467 H20 OH2 

468 H20 OH2 

469 E20 OB2 

471 H20 OH2 

472 H20 OK2 

473 B20 OH2 

474 B20 OB2 

475 B20 OH2 

476 B20 OK2 

477 B20 CH2 

478 B20 OR2 

479 B20 OH2 
460 B20 OB2 

481 B20 OH2 

482 B20 OH2 

483 B20 OH2 

484 B20 OH2 

485 B20 OB2 

486 H20 OH2 

489 H20 OB2 

490 B20 OH2 

491 820 OK2 

492 B20 OK2 

493 B20 OH2 

494 S20 OKi 

495 B20 OH2 

496 B20 OH2 



33.375 
10.662 
28.400 
37.069 
35.149 
14.410 
34.593 
33.293 
16.935 
36.502 
30.888 
6.433 
23.735 
30.269 
6.916 
31.535 
21.133 
26.156 
20.961 
10.366 
15.664 
15.488 
6.523 
6.347 
20.408 
9.986 
34.820 
17.186 
12.491 
31.523 
12.628 
33.466 
19.599 
16.152 
12.458 
37.639 
9.651 
33.545 
9.256 
35.476 
23.365 
11.732 
30.073 
16.204 
27.601 
2.443 
33.485 
16.400 
34.584 
18.844 
17.595 
19.970 
29.931 
26.764 
24.923 
4.494 
25.927 
19.179 
33.544 
7.275 
18.187 
14.703 
14.414 
20.741 
3T - £C4 

25.506 



39.759 
51.211 
26.227 
31.271 
22.967 
35.423 
37.589 
43.729 
12.276 
36.642 
44.367 
14.502 
32.721 
39.336 
37.376 
45.230 
38.497 
30.548 
41.888 
9.353 
13.252 
35.603 
29.548 
42.537 
28.429 
37.579 
21.034 
30.632 
19.964 
29.927 
27.138 
44.288 
43.860 
29.460 
29.430 
14.764 
34.465 
17.795 
16.911 
39.839 
24.048 
35.837 
50.380 
22.887 
27.623 
14.804 
27.966 
18.715 
26.355 
26.392 
33.022 
49.821 
22.624 
29.952 
29.997 
34.569 
28.389 
31.050 
35.B59 
28.059 
52.286 
47.608 
29.083 
38.573 
22 352 
32.823 
21.376 



31.397 
37.076 
22.233 
18.172 
42.892 
17.549 
20.470 
30.636 
22.731 
39.753 
36.634 
42.412 
13.204 
42.632 
38.041 
24.294 
43.405 
26.735 
36.136 
42.909 
41.086 
22.544 
42.831 
28.354 
14.479 
24.768 
34.828 
13.537 
46.613 
11.890 
21.026 
34.479 
38.560 
52.727 
17.126 
37.217 
20.032 
26.313 
34.260 
21.547 
13.490 
17.577 
31.035 
7.809 
26.352 
32.338 
24.079 
49.507 
28.896 
36.213 
12.700 
15.851 
47.074 
13.997 
46.055 
48.325 
42.632 
19.865 
34.951 
36.209 
20.471 
24.076 
26.931 
12.784 
42.540 
30.485 
19.908 



498 H20 OH2 

499 H20 OH2 

500 K20 OK2 

501 H20 OK2 
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508 H20 OH2 
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510 B20 OH2 
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512 H20 OK2 
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518 H20 OK2 
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520 H20 OK2 
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14.394 
39.498 
20.574 
41.254 
18.615 
23.238 
11.027 
6.051 
20.329 
34.042 
18.600 
23.964 
14.955 
31,742 
13.014 
3.657 
8.348 
9.871 
18.301 
10.419 
11.150 
43.085 
20.416 
40.300 



49.504 

21.926 

46.516 

36.175 

23.589 

48.249 

27.025 

28.870 

51.097 

46.991 

14.484 

14.515 

20.395 

13.971 

49.698 

17.317 

35.692 

26.970 

41.737 

21.355 

32.989 

38.642 

57.764 

29.469 



27.666 

32.920 

27.909 

22.038 

42.251 

18.498 

49.749 

41.533 

40.041 

33.740 

12.899 

28.480 

22.995 

22.917 

46.176 

43.260 

23.895 

29.151 

20.959 

11.387 

33.268 

27.705 

27.758 

52.597 
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